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This issue of The Bell System Technical Journal is devoted 
to a selection of articles dealing with various phases of math- 
ematical statistics and quality control. The Editorial Board 
_ and Editorial Staff of the Journal present this “‘all statistics” 
issue in the belief that the growing importance of statistics 
to communication technology warrants the simultaneous pub- 
lication of these articles. 


The Editors are pleased to include in this series of papers on statistical 
subjects one by Dr. Walter A. Shewhart whose pioneering work in statistical 
quality control has served as an impetus to wider use of statistical methods 
in the Bell System. This paper, which dates back to 1935, was one of a series 
of internal technical memoranda of the Quality Assurance Department of 
the Bell Telephone Laboratories, Inc. It was prepared by Dr. Shewhart in 
the course of a serves of departmental growp discussions having to do with 
the development of the fundamental philosophies of quality control and 
quality assurance. 


Nature and Origin of Standards of Quality 


By W. A. SHEWHART 
(Manuscript received September 25, 1957) 


This paper discusses the importance, from the viewpoint of judging 
quality, of: the end to be served by a standard of quality; the nature of the 
accepted binding force of the standard upon the acts of those interested in 
the standard; and the role of the judge of quality in shaping the standard in 
terms of natural law, authority, specification, custom, and precedent. 


1 


2 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1958 


I. OBJECT 


The control of quality of manufactured product involves three co- 
ordinate functional steps: the specification of the aimed-at standard of 
quality; the production of pieces of product that will be of standard 
quality; and the determination of whether or not product thus made is 
of standard quality. These three steps are respectively legislative, execu- 
tive, and judicial in character. The object of this paper is to consider the 
nature and origin of standards of quality from the viewpoint of judging 
the quality of product. 

Such a judgment as herein considered is made the basis of one or the 

other of two kinds of action: (1) the acceptance or rejection of a piece 
of a given kind of product for service; and (2) the adjudication of a 
complaint about the quality of a piece of product in service. The two 
judgments are of the type: J4 — this piece of product (or this lot of NV 
pieces of product) is (or is not) of standard quality, and J, — this piece 
of product (or this lot of N pieces of product) was (or was not) of stand- 
ard quality. In either case, it should be noted that the judgment is 
rendered in respect to the quality of a piece of product that is already in 
existence at the time the judgment is rendered — it is a judgment after 
the act of specifying and after the act of making the piece of product in 
question. This problem of judging the quality of a piece of product after 
it is made is definitely different from the legislative problem of specify- 
ing prior to the making of a piece of product what its quality should be 
in the light of information then available; and different from the co- 
ordinate executive problem of making a piece of product that will have 
the standard quality. 

Judgment, in the sense here used, implies a comparison of the quality 
of a piece of the given kind of product at some particular time with the 
standard for the piece at that time in the light of the evidence then 
available. If it were possible to specify completely and in an opera- 
tionally definite and verifiable sense the standard of quality for things 
of a given kind, and if it were possible to specify the operational tech- 
nique that would determine with certainty whether or not the quality 
of a given thing was that specified, the problem of judging would be 
routine in nature. But neither of these operations is possible. Hence in 
judging the quality of product, we must take account of the fact that a 
standard cannot be specified in this rigorous sense and that the practical 
standard of quality is determined not alone by written specifications of 
the quality characteristics prior to the making of a particular piece of 
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product but also by natural law, authority, custom, and precedent, 
existing at the time the particular piece of product is being judged. In 
other words, the quality judge is not, as it were, handed a standard of 
quality already made with which to compare the quality of a given piece 
of product. Instead he is only handed the stones with which to build 
such a standard. Through his interpretation of specifications, custom, 
precedent, natural law, and authority, the quality judge in a sense 
gives operational meaning to the standard of quality in much the same 
way that a judge gives operational meaning to the law of the land, 
whether it be statutes, custom, precedent, or constitution. 

Obviously, therefore, before a quality judge may render a judgment 
of either type J, or Jz, he must “determine” the standard that is to 
be used. But what is there to guide such determination? It goes without 
saying that he is not free to act as he pleases. In what follows we shall 
see how the acts of the quality judge in determining the standard depend 
upon: (a) the intent of the standard; (b) the nature of the binding force 
that the standard is presumed to have upon those concerned; and (c) 
the available source or sources from which a standard must be derived. 

To begin with, we shall consider the nature of a standard of quality 
as a means to an end, as this will give us a background for considering 
in turn the binding or constraining force of a standard upon the acts of 
those making use of it and then the origin of a standard in natural law, 
authority, specifications, custom, and precedent. 


Il. STANDARD AS MEANS TO AN END 


Dr. Gaillard of the American Standards Association defined a standard 
as: “‘A formulation established verbally, in writing or by any other 
graphical method, or by means of a model, sample or other physical 
means of representation, to serve during a certain period of time for 
defining, designating, or specifying certain features of a unit or basis of 
measurement, a physical object, an action, a process, a method, a prac- 
tice, a capacity, a function, a performance, a measure, an arrangement, 
a condition, a duty, a right, a responsibility, a behavior, an attitude, a 
concept, or a conception.” 

This definition stresses one important characteristic which is com- 
monly attributed to a standard, namely, that it is something fixed. The 
definition of standard here is very broad indeed; it would seem to include 
the rules of mathematics and formal logic, the rules of syntax of a 
language, and even legal statutes. In fact, it also includes social mores 
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and customs. All of these in some way or other satisfy the condition of 
being established for a time in one or the other of the ways specified in 
the definition. In fact, social intercourse is made possible only through 
standards in this broad sense. 

Perhaps the earliest conceived end to be attained in the process of 
standardization was that of attaining a certain more or less fixed order 
so that one might know, as it were, where one was at any time. We 
needed to have from the very beginning of social experience a more or 
less fixed meaning to symbols or words. In order to avoid utter confusion 
it was early recognized to be necessary to maintain a certain status quo. 
Typical statements of objectives expressing this end found in the litera- 
ture of standardization are to stabilize production, eliminate purely 
traditional practices, eliminate indecision in production and distribu- 
tion, place competition on the basis of essentials, protect buyers, and de- 
crease litigation. A standard from this viewpoint is comparatively fixed 
and judicial interpretation of the standard must stress the significance 
of the past. It is an instance of the type, “If it is good enough for father 
it is good enough for me’’. 

The first national standardizing society was formed in 1901 and from 
then to 1935 twenty-four others had been formed, most of them shortly 
after World War 1. The literature on the subject would seem to indicate 
that the most influential objective at that time was economic in charac- 
ter. Witness, for example, such objectives as: to decrease indirect ex- 
pense, reduce expense through decreasing the variety of necessary tools, 
reduce investment, increase output of workers, make selling easier, make 
possible more efficient and more economic design. 

As soon, however, as engineers began to stress the economic advan- 
tages of establishing standards of quality, they sensed the necessity of 
allowing for a comparatively rapid change in factors affecting the 
economies to be attained through a fixed standard. To secure these 
economies it was necessary to maintain a certain degree of fixity but at 
the same time it was also necessary to allow for changes. This was par- 
ticularly true in the light of the rapid development in production 
processes and types of material, where the economic situation might 
change over night. As a result, judicial interpretation of standards es- 
tablished for the purpose of effecting economies had to lay more em- 
phasis on discretion and less on the fixity of the rule or letter of the 
written specification than it did when the standard was primarily con- 
sidered as a means of maintaining the status quo. 

It was not long, however, after this comparatively recent beginning 
of the expansion of the functions of standards of quality that other than 
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economic advantages began to be stressed in the literature in such state- 
ments of objectives, for example, as: to stimulate research and develop- 
ment by bringing out the need of new facts in order to determine what is 
best and to make possible a higher average standard of living. Such 
expressions are but symptomatic of the recognition given by engineers 
in recent decades to a broader interpretation of the social objectives of 
mass production; of the attention being given to the problem of har- 
monizing and satisfying the many wants and interests of a group rather 
than laying emphasis upon one want, such as maintaining the status quo 
or securing economies of production, to the practical exclusion of others. 
Within recent years we are coming more and more to think of standards 
as a means for harmonizing and satisfying the greatest number of human 
wants in a given group at a minimum of cost. Certain companies, for 
example, operate under an expressed policy of attempting to give a 
quality of product which is adequate, satisfactory, dependable, and 
economic from the viewpoint of all concerned. In the attainment of 
such a policy it is obviously necessary to have the advantage of the es- 
tablished order provided by standards and the economies that accrue 
through standardization but the advantages of order and economy must 
be considered in the light of other wants. 

Whereas it was necessary in order to attain the economic end to lay 
greater stress on discretion and less on the fixity of a standard than it 
was in order to maintain the status quo, so likewise in order to attain 
the harmonization and maximum satisfaction of human wants imposed 
by many production policies of today, we need give further emphasis to 
discretion and still less to the letter of a previously established standard. 
In other words, with the growth of the end to be attained by quality 
standards from the simple maintenance of order and the status quo to 
economies of production and finally to a quality that is satisfactory, 
adequate, dependable, and economic, there has been a corresponding 
decrease in the stability of the end which must be given due weight in 
the judicial interpretation of standards of quality. Not only must allow- 
ance be made for the rapid development of new processes of production 
and the development of new materials, but also for the rapid develop- 
ment and change in the wants of any given group. 

As a basis for our further consideration of the nature of the binding 
force of a standard and later for our consideration of the origin of a 
standard in natural law, authority, specification, custom, and precedent, 
we shall assume that a standard of quality for a given, kind of thing 
consists of the magnitudes of those quality characteristics of the thing 
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which are necessary and sufficient in order that the quality of the thing 
shall be satisfactory, adequate, dependable, and economic from the 
viewpoint of those concerned with the standard. 


Ili. THE BINDING FORCE OF A STANDARD 


The existence of a standard of quality for a given kind of thing im- 
plies a certain responsibility on the part of producer and consumer to 
abide by it. If this were not so, the standard would have little practical 
significance. To begin with, it is desirable to distinguish between the 
way one is constrained to act by the force exerted by some external 
agency, whether human or otherwise, independent of one’s own experi- 
enced interest or volition, and the way one is constrained to act in ac- 
cord with one’s own interest and volition in the light of one’s under- 
standing, knowledge, and belief about the external world. In the one 
case the binding exists even though it be not willfully and gracefully 
accepted and in the other case the binding exists only when accepted. An 
example will help to clarify the distinction. If there really exists a natural 
law corresponding to the first and second laws of thermodynamics then 
a perpetual motion machine is impossible no matter how much we will 
or desire to have one. The fact is that we do not know with certainty 
that these laws are really laws of nature. The degree to which we will- 
fully bind our acts in accord with these laws depends upon our degree 
of belief in the laws. There are, of course, some who every year try to 
patent perpetual motion machines. The following discussion is limited 
to a consideration of the willed or accepted binding. There are at least 
four kinds of accepted binding force to which attention need be given in 
the consideration of standards from the viewpoint of judging quality: 
(a) natural law, (b) authority, (c) individual interest, and (d) group 
interest. 

Perhaps all of us have a certain belief in the fixity, order, and uniform- 
ity of the external world in which we live. We customarily admit the 
limitations of our human efforts to ward off death indefinitely, to build 
a perpetual motion machine, and to do a multitude of other things. We 
are bound, as we say, to follow the “laws of nature” be they physical, 
psychological, economic, or any other. In the next section, we shall 
consider the ways in which belief in natural law binds us in certain ways 
to conform to certain standards. We shall observe, however, the impor- 
tance of the fact that the man-made nature of natural laws as stated has 
the practical effect of directing our attention away from an objective 
nature to the knowledge of nature possessed by the individual and the 
knowledge common to the group. Natural law is not something sticking 
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out in the world that each and every one of us perceives with ease and 
certainty. Hence one cannot “‘prove’’ to one or more others the binding 
of a natural law in standardization until the others are convinced of the 
existence of such a law and then only to the degree of this belief or con- 
viction. In rendering a judgment about quality in respect to a standard 
the quality judge must therefore take into account that it is not the cita- 
tion of a natural law that is binding upon willful or interested acts but 
the belief in this law on the part of those bound and that this belief may 
be either rational or irrational or a combination of the two. In other 
words, the accepted bindingness of cited natural law upon an individual 
or group depends upon the belief in the validity of such law on the part 
of the individual and group. 

Let us now consider the binding force of authority sanctioned by so- 
ciety at large. Such authority may be legal, potentially legal, or simply 
that of some institution of society, such as a church. Assume for the 
moment that there existed a sovereign power in the group interested in 
a given standard such that a specification issued by this soverign agent 
either could be made binding by force upon every member of the group, 
or was accepted as binding by every member of the group. Obviously 
the judicial findings of the quality judge would in the end have to be 
acceptable to such a sovereign power. Judicial interpretation of a 
standard in terms of specifications, custom, and the like, would therefore 
have to be made in such a way as to be acceptable ultimately to the 
sovereign authority. To be more explicit, suppose there existed a group 
such as a national or international standardization body that had sov- 
ereign authority to fix standards. The viewpoint of the quality judge 
would always have to be directed toward the intent of the standard as 
fixed by such a group in much the same way as a judge in a legal dispute 
in certain political systems must look to a king or parliamentary body 
for ultimate approval and sanction of his findings. But today there 
perhaps exists no group having sovereign authority in this sense in the 
case of quality standardization in mass production. In fact, even in 
political systems acceptance of sovereign authority as such would appear 
to be on the wane. The significant point to be noted for our present pur- 
pose is that the ultimate source of accepted authority is the group in- 
terest or, perhaps better, the common interests of the group. For ex- 
ample, it may be argued that insofar as the laws of a political unit or 
state are concerned today with the attainment of the harmonization and 
satisfaction of human wants, standards set in this way should be ex- 
pected to be pretty much the same as those determined by the interests 
of the group —the ultimate sanction of the standard is not in the 
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penalties laid down by laws nearly so much as it is in the evolving habits, 
opinions, interests, and emotions of the members of the group. 

It may be justly argued that the last three aspects of constraint are 
hopelessly confused one with another in any given case. For example, it 
may be argued that one’s personal interest, whether he be producer or 
consumer, in holding to a standard with the expressed intent of bringing 
satisfaction to others is in fact sanctioning the standard simply because 
he believes that this is the only way in which he himself may obtain 
certain advantages. Likewise, it is to be expected that a producer must 
give attention to ways and means of satisfying the wants of others if he 
is to maintain the good-will so necessary in disposing of his goods. In 
turn, it is to be expected that the consumer must in certain instances be 
satisfied with taking something that the majority wants in order to 
attain the economic advantages of mass production. Why then is it 
desirable from the viewpoint of judging quality to differentiate between 
these sources of binding force, and particularly between that based on 
individual interest and that based on group interest? 

At least one important reason is not hard to see. The individual 
interest is, broadly speaking, the activating element in bringing about a 
more or less continual change and improvement in a given standard, 
whereas the group interest serves as a constraint under which individual 
interest must operate. Individual interest helps to keep standardization 
from falling into a rut: helps in guiding the action which results in stand- 
ards of tomorrow which give greater satisfaction and harmonization of 
wants than do the standards of today. But on the whole standards of 
today are physically far more complicated and involved than were those 
of yesterday. Progress depends to a large extent upon the element of 
change and in order to bring about this acceptable change the producer, 
for example, may be bound to hold more rigorously and to a more in- 
volved standard than would at a given time be demanded by the wants 
of those to whom he wishes to bring satisfaction or by the authority 
sanctioned by the group. 


IV. ORIGINS OF STANDARDS OF QUALITY 


As a starting point, it will be helpful to consider briefly the meaning 
of quality as it will be made the basis of what follows. We shall dis- 
tinguish among the following three types:* Type I— Those quality 
characteristics which make a thing what it is independent of human 
interest or volition; Type II — Those quality characteristics which 


* These types of quality were discussed in more detail in a paper, Some Aspects 
of Quality Control, Mechanical Engineering, December, 1934. 
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characterize a thing A in relation to another thing B as a part to a 
whole and independent of human interest or volition; Type III — Those 
quality characteristics which make a thing wantable by some one or 
more persons. 

It is necessary to note that the quality of a thing in this sense is that 
which characterizes the thing throughout its life. In what sense then can 
we speak of the quality of a thing, such, for example, as a relay, fountain 
pen, or any manufactured article, at any stated time? Thus what does 
it mean to say that my fountain pen is one that operates easily, gives a 
steady flow of ink, and does not scratch the paper? In more general 
terms, what does it mean to say that the quality of a thing is such and 
such? In the case of my fountain pen, my judgment that ‘‘it is one that, 
etc.,” is based upon my past experience that it has under certain condi- 
tions and at some time or times in the past operated easily, given a 
steady flow of ink, and not scratched the paper. I imply by such judg- 
ment that under certain conditions and at some time or times in the 
future my pen, if used, will operate easily, and so on. In much this sense, 
the judgment that the quality of any thing 7s such and such is from a 
practical viewpoint equivalent to a judgment that it will be such and 
such. Moreover, such a judgment is based upon certain evidence ob- 
tained through certain operations on the thing or similar things in the 
past and implies that certain experience will result if certain operations 
are carried out on the thing in the future. From this viewpoint, the 
understanding of a statement or judgment about the quality of a thing 
necessitates that we treat it as a probable inference about potential 
experience that may be expected if one operates on the thing in certain 
ways and when each inference is based upon evidence or experience de- 
rived from previous operations on this thing or similar ones. Hence we 
shall consider first the origin of standards of quality in natural law or the 
uniformities of nature believed in as relating past to future experience. 


4.1 Natural Law 


Without the existence of uniformity in nature there could be no stand- 
ards. However, a fundamental characteristic of natural law of immediate 
interest is that to a certain extent it is man-made and is not certain, that 
is to say, the formulation of natural laws depends upon a priori concepts 
as well as the experiences of human beings. Such laws in practice are but 
approximations to the laws or uniformities which scientists tacitly as- 
sume to exist. It follows that the statement of a law amounts to a prob- 
able inference based upon certain specific evidence and as such is open 
to revision on at least two counts: (a) the acquisition of additional per- 
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tinent evidence, and (b) the discovery of an error in the formal proce- 
dures constituting a part of the generally accepted inductive processes 
of arriving at a probable inference. 

The evidence upon which laws fix boundaries to attainable qualities 
of the first two types is that derived from the natural sciences, particu- 
larly physics and chemistry, whereas the corresponding laws determining 
the wantableness of a thing are derived not only from a study of physics 
and chemistry but also from a study of psychology, physiology, and 
other sciences, involving the human element. 

From the viewpoint of judging quality, we must, therefore, consider 
two consequences of the accepted fact that our knowledge of natural 
law is probable only: (a) in the process of collecting evidence the quality 
judge discovers certain aspects of natural law which effectively shape 
the objective standard of quality, and (b) the quality judge must allow 
for the fact that the accepted binding force of man-made approximation 
to natural law depends upon the belief in the validity of the law. This 
is particularly true in the case of economic and social “laws”, for ex- 
ample, although the same is true to a lesser degree in the case of other 
natural laws. 

Now, in accepting the policy of attempting to make a thing whose qual- 
ity is satisfactory, adequate, dependable, and economic, we tacitly as- 
sume that there is a set of, let us say, ms quality characteristics of magni- 
tudes Z1, Z2, ++: Zi, +++ Zms, Characterizing the objective standard 
which would harmonize and maximize the satisfaction of the wants of 
a given group of people. It is this set of values of these characteristics 
which is assumed to be fixed by real natural law and which it is the aim of 
research, development, and design to be able to specify prior to the 
start of production. Obviously, however, such a set of ideal characteris- 
tics can only be approached as a goal. Quantity production cannot wait 
until such a set has been discovered and made a part of a specification 
because to do so would mean that production in every case would have 
to be postponed indefinitely. This is true not only because of the un- 
certainty of our knowledge of effecting the best means to that end 
but also because research, development, and design engineers find it 
necessary to introduce modifications in their original specifications as 
new evidence is brought to light through research in the natural and 
social sciences. 

The point now to be considered, however, is that such needs for 
changes in specifications and accepted standards of quality arise not 
only from the evidence obtained in research proper but also from that 
uncovered in the process of judging quality in the light of evidence ac- 
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cumulated in the process of production and inspection and in the adju- 
dication of complaints. It goes without saying, perhaps, that one of the 
very practical ways of detecting the failure of an apparently accepted 
standard of quality to satisfy the one who makes use of the product is the 
evidence brought to light in the study of complaints. There is, however, 
a more important although perhaps less generally recognized limitation 
imposed by natural law which falls pretty much within the domain of 
the quality judge to translate into requirements of a standard of quality. 
We cannot make things identically alike, one with another, in respect to 
any given characteristic; presumably there is a limit to which we may 
hope to go in removing the causes of variability of a given product made 
by a given process. In other words, there is a limit beyond which we 
perhaps cannot hope to go in controlling variability. There is also 
another limit of interest in the control of variability, namely, the limit 
beyond which it is not economic to go in a given case. These two limits. 
of course, are for the most part independent of interest or volition. There 
is, however, also a third limit which is of importance and this depends 
upon the human element, namely, the limit to the range of variation that 
will in any way influence the wantability of a thing. 

For the most part, evidence which is pertinent to the determination of 
the first two kinds of limits arises in the production and inspection of 
product — in other words, comes from the data which the judge of qual- 
ity must accumulate in the process of judging quality. It is but reason- 
able, therefore, that the quality judge should make use of such data not 
only for the purpose of judging the particular case at hand but also for 
the purpose of helping to establish the economic or desirable degree of 
control of the variation in quality about the aimed-at value. Again, in 
the study of causes of complaint, information may be obtained which 
may be valuable in helping to fix the limit of variation in the physical 
quality of a thing from the viewpoint of its wantableness, particularly 
as evidenced by sensory experience. In general, of course, wantableness, 
particularly in the case of new product, is often not dependent upon 
what the trained experimental psychologist or physicist fixes as the 
minimum detectable difference. For a time at least a much wider varia- 
tion will pass unnoticed from the viewpoint of wantableness and hence 
from the viewpoint of a user of a thing it would not be economically 
desirable to control the limits of variation within a narrower range than 
that thus determined. 

Now, let us pass to a consideration of the nature of the binding force 
of man-made natural law upon all concerned with a given standard. Just 
as we noted in the beginning that it is likely that everyone would admit 
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the binding of what is accepted as natural law, it is also reasonable to 
believe that the majority, at least, would agree as to what is to be ac- 
cepted, if they had common training and had given due consideration to 
all evidence upon which a given law is based. Obviously, however, it is 
not feasible, in general, to attain a binding upon the basis of a common 
understanding and interpretation of all pertinent data. Such binding 
as does exist perhaps must arise upon the acceptance of those who make 
or state the man-made natural laws as technical authorities. More often, 
however, it is likely that the binding force of a standard fixed in terms of 
natural law upon the majority will be in terms of the reasonableness of 
such a standard in the light of more or less common evidence pertinent 
to the standard rather than upon all of the evidence which an experienced 
technical authority might make the basis of his decision. Hence, it may 
come about that the common acceptance of judicial findings will depend 
more upon the proper choice of the simpler facts than upon what the 
trained specialist would consider to be the more weighty although more 
complicated evidence. It is important, however, for the quality judge 
to keep in mind that what would be acceptable today on such a score is 
likely not to be acceptable tomorrow when the common store of perti- 
nent evidence is greater. Hence in a certain sense the quality judge, if 
he is to allow for growth, is bound to consider the whole of the evidence 
even though for acceptance by the majority at the moment he need 
consider only a part. 


4.2 Authority 


In general the group of people interested in any standard of quality 
constitutes a part of a larger group bound together in some political unit, 
federal, state, or otherwise. Each such political unit has its laws and rules 
of action, some of which apply directly, and others indirectly, to stand- 
ardization. In other words, legislation by duly constituted bodies, both 
political and otherwise, has in many countries and over a long period of 
time constituted one source of standards of quality. As in our con- 
sideration of natural law as a source we chose the viewpoint of a quality 
judge, so here too will the same choice be made. That is to say, no con- 
sideration will be given to a critical appraisal of such a source except 
as may be helpful in judging quality under conditions where a legal 
standard is provided. 

To begin with, let us consider the significance of the generally accepted 
binding force of law in the political unit fixing a standard of quality. Take 
first the case where authority is accepted as a consequence of a postu- 
lated divine right of a monarch to make law. The judgment of quality 
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under such a condition would in the last analysis have to be acceptable 
to the monarch or his duly appointed agent. Under such conditions the 
accepted binding force of natural law would even be secondary to that 
of the law of the monarch. On the other hand, if we consider law not as 
an end in itself originating from a source on high whose authority is 
accepted without question but as a means to an end, the whole aspect of 
the problem of judging quality is changed. For example, if we choose 
the sociological end of law, current since the beginning of the twentieth 
century, expressed by Dean Pound of the Harvard Law School as the 
attainment of harmonization and maximization of the satisfaction of 
human wants, then the aimed-at standard under law would be that 
fixed by natural law in much the way we have already considered. 
Whereas a legal standard under the concept of divine right or other 
sovereign power must be that acceptable to the monarch or legislative 
authority, a legal standard under the sociological concept of the end 
of law expressed by Dean Pound is one acceptable only in so far as it is 
rational or reasonable in the light of natural law as understood at the time 
by the majority of those concerned with the standard. 

Omitting any discussion here as to whether or not it is feasible or even 
desirable, if feasible, that legal statutes fix standards of quality, let us 
accept the fact that attempts have been made so to fix them. There are 
at least four important reasons why it is exceedingly difficult to fix a 
standard by statute that might not later be judged in a legal court as 
unreasonable, assuming, of course, the sociological end of law: (a) In 
the first place, it is not feasible to discover with certainty those quality 
characteristics (the Z’s of the previous section) which would charac- 
terize the ideal goal. Hence new results of research obtained after the 
statute was passed might give rise to a need for change in the statute; 
(b) Even though it were possible thus to specify the ideal quality, it still 
remains an exceedingly difficult and uncertain task (as will be exempli- 
fied in future sections) to specify, at the time the statute is passed, 
definite and verifiable inspection procedures that will give a detailed 
description of the conditions under which acceptance of product is for- 
bidden or allowed; (c) It would be practically impossible to establish at 
the beginning and once for all the economic tolerances that must be 
allowed because of the existence of unknown causes of variation which 
must be left to chance; and (d) Because wants as well as means of satis- 
fying wants are constantly changing, there is a continual need for con- 
sidering the desirability of changing a standard. 

An important point to be considered in lay judging of quality in re- 
spect to conformance with legal standards where they do exist is that 
there is great room for legal interpretation of any such statute particu- 


14 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1958 


larly if the sociological end of law is the one accepted. In fact, such a 
statute is likely to be pretty much what the legal judge interprets it to 
be in the light of available evidence. Hence the lay judge of quality must 
often look to court decisions and interpretations for guidance in giving 
definite operational meaning to a legal standard of quality when it does 
exist, in order that his decision may be in terms of the standard as it 
would be interpreted by ‘the legal judge. In so doing, however, the 
quality judge effectively helps in shaping the standard of quality. 


4.3 Specification 


Since 1900 and particularly since the end of World War 1, the most 
important source of standards of quality has been that of specifications 
made and approved by industrial groups. Within this short period of 
time, many national and international standardizing bodies have been 
formed. Throughout this period there has been a rapid growth in the 
development and use of specifications by separate organizations. In 
this section we shall consider briefly certain characteristics of a specifica- 
tion of particular interest from the viewpoint of judging quality. In par- 
ticular, we shall consider the important part played by a quality judge in 
giving effective meaning to a specification and shall consider the sources 
to which a judge of quality should look in order to determine the binding 
of a specification. 

First let us consider what it means to specify the quality of a thing in 
general. For convenience let us assume that for a given thing there are m 
quality characteristics, X;, X2,-°+:Xi,--+ Xm,, of the first type; me 
quality characteristics, Y1, Ye, --- Yi, --: Ym,, of the second type; 
and ms; quality characteristics, 271, Z2,°-: Z:, +: Zm3, of the third 
type. In order to give operationally verifiable meaning to any one of 
these quality characteristics, it is necessary to specify four kinds of 
operations and in addition to specify the limits within which the ob- 
served results of these four operations should lie. Thus for X; we have: 

Sx, = 1. Specify the method of perceiving or measuring the quality 

characteristic. 

2. Specify the number n of repetitions to be made of the opera- 
tion of perceiving or measuring under the same essential 
conditions. 

3. Specify who is to perceive or measure. 

4. Specify the method of analyzing the results of the n repe- 
titions. 

5. Specify the limits within which the observed results of the 
previous four operations should lie. 
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Similar expressions hold for any Y; or Z;. As previously noted, the ideal 
specification of the quality of a thing would state the necessary and 
sufficient verifiable operations to describe that quality which is satis- 
factory, adequate, dependable, and economic. Such a quality charac- 
terization obviously extends throughout the life of the thing. Funda- 
mentally, this would require an indefinitely extended specification even 
if we assume a comparatively small number of quality characteristics 
to be necessary and sufficient, and one that could not be verified except 
by examining or operating on the thing throughout its life in a prescribed 
manner. In practice, therefore, it is customary to specify certain require- 
ments which a thing is supposed to meet up to the time that it goes into 
service. Of course, some or all of the specified X’s and Y’s might be those 
in which the consumer interest centers but in general they are not those 
directly sensed or experienced by the consumer but rather those which 
serve to characterize a thing physically. Witness, for example, the dif- 
ference between those quality characteristics used in advertising an arti- 
cle and those used in technical specifications. 

An important fact to keep in mind is that even though the quality of a 
thing satisfies the technical specifications of the form Sx, and Sy, , 
one cannot be certain that this quality will be such that it will prove 
satisfactory, adequate, dependable, and economic. All that an engineer 
can do in preparing a set of specifications is to choose that set which upon 
the basis of his interpretation (or the interpretation of a certain group) of 
the scientific evidence available to him (or to the group) at the time, 
constitutes grounds for a high degree of rational belief or probability 
that the quality of things which satisfy these requirements will also be 
found to be satisfactory in service. In other words, two factors must be 
considered, one the evidence F available at the time, and the other the 
degree of belief or probability based upon such evidence. Obviously, 
this probability may change with increase in pertinent evidence which 
may come about either through the process of continued research or 
through the information attained in the course of production and in- 
spection as well as in the analysis of complaints. It is therefore necessary 
to recognize that specifications adequate upon the basis of evidence 
available at the time that they were written may not be similarly ade- 
quate upon the basis of information made available in the course of pro- 
duction, inspection, and use. Under these conditions, there are at least 
five ways in which the quality judge effectively plays an important réle 
in helping to fix or shape standards of quality: 

(a) In judging whether or not a piece of product shall be accepted 
or rejected, the quality judge must take into account, in accord with the 
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assumed policy objective of production, any evidence which may have 
come to hand, particularly in the processes of production, inspection, and 
analysis of complaints, indicating the present specifications to be incom- 
plete in that they do not include requirements on certain variables 
which it seems desirable to control. Quite naturally such requirements 
will sooner or later find their way into specifications, but the quality 
judge must, insofar as possible, act in accord with what he considers to 
be potential changes if the policy of accepting only quality that may 
reasonably be expected to be satisfactory, adequate, dependable, and 
economic is to be met. In other words, the quality judge must fill in the 
gaps in existing specifications in so far as new evidence obtained since 
such specifications were written would indicate to be reasonably de- 
sirable. 

(b) If the quality judge is to accept the theory that a specification is 
but a means to an end and is to take account of the fact that the justi- 
fication of a specification rests upon an ever-changing body of evidence, 
it is necessary for him to use discretion in judging quality of product to 
be either acceptable or rejectable upon the basis of specifications alone. 
In other words, certain non-conformance cases may arise in respect to 
specified quality characteristics which may have under certain condi- 
tions little effect upon the experienceable quality of such equipment in 
use. In such a case it may likely be uneconomical on the part of all con- 
cerned to reject such product. Such action on the part of the quality 
judge is not, as it were, ignoring a specification but rather making a judg- 
ment upon evidence which was not available at the time the specifica- 
tion was written. 

(c) If any one of the four items in Sx, and Sy, are omitted in the 
written specification, it is necessary that such be supplied by the quality 
judge. For example, specifications sometimes simply state that some 
quality characteristic such as mass, length, capacity, resistance, or the 
like, shall lie within certain limits. Such a specification is incomplete 
upon the basis of the first four counts: it does not specify the method of 
measurement; the number of repetitions of the measurement; the one 
who is to make the measurements; nor the function of the measurements 
that are to be within the set limits. Another typical failure of a specifica- 
tion to be definite is illustrated by each of the following requirements 
taken from actual specifications: ‘‘The zine alloy shall be 99.99 % pure’, 
“The spark gap shall be adjusted to 0.008 in.’”’. The quality judge here 
must supply not only the first four types of operations but also the 
missing limits! 

(d) In general, the specification engineer has in mind in writing his 
specification the limits within which the quality of a single piece of equip- 
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ment should lie if it is to be that which he believes will prove to be 
wanted. True enough, he is likely to give weight to the data constituting 
his previous experience of production methods which indicates limits 
within which variability may be expected under production. Obviously, 
however, such evidence is likely to be very meagre indeed as compared 
with the cumulative evidence obtained after production starts. Ex- 
perience shows that there is an economic limit to the allowable varia- 
tion in the quality of product turned out in a given process. In other 
words, it is often found that it is more economical to discover and 
eliminate assignable causes of variation of quality than it is to leave 
these in the production process and reject that portion of the product 
that does not meet the required limits. The quality judge has an im- 
portant rédle to play in devising techniques which will indicate the 
presence of assignable causes and of using these in helping the production 
department to establish economic control limits which serve as standards 
for future production. 

(e) We now come to what is perhaps the most important réle of the 
judge of quality in giving operational meaning to a specification. Even 
though an operationally definite and verifiable meaning of quality is 
given in the specification, there are two reasons why it is often necessary 
to resort to sampling in order to determine whether or not quality 
meets the specification: (a) it is often uneconomical to give 100 per cent 
inspection, particularly where defective parts would be weeded out in 
final assembly or at the time of installation, and (b) it is often not 
feasible to give 100 per cent inspection because of the destructive nature 
of the method of verification of the quality, as, for example, in testing 
the tensile strength of materials and the blowing current for fuses. In 
such a case the quality judge must supply an inspection specification 
which will insure the following two things: (1) that a satisfactory amount 
of data or evidence will be accumulated upon which to render judgment 
as to the nature of the quality of the unsampled portion of the lot, and 
(2) that an operation will be indicated to determine whether or not it 
shoud be rejected whenever the degree of belief in the satisfactoriness 
of the unsampled portion of the lot upon the basis of evidence thus ac- 
cumulated is insufficient to justify the acceptance of the lot. The ques- 
tion, How much data?, depends in general upon the degree of economic 
control of quality previously obtained and hence the inspection operation 
specified must be such that it keeps abreast of the continual supply of 
information obtained in the process of inspecting product if such an 
operation is to give adequate assurance of quality at a minimum of cost. 

We are now in a position to turn our attention to a consideration of the 
nature of the binding force of specification. In the first place, a specifica- 
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tion may be made the basis of a contractual agreement between two 
parties, in which case it takes on certain legal as well as moral binding 
force characteristic of a contract. One of the conditions usually assumed 
for the validity of a contract is that the two parties to the contract be 
cognizant of the contents thereof. Of course, in many instances specifi- 
cations of quality are extremely involved from a scientific and engineer- 
ing viewpoint and hence it is to be expected that parties to a contractual 
agreement involving highly technical specifications of quality must be 
capable of arriving at a common meaning of such specifications. This 
limits the field in which technical specifications may be made the basis of 
valid contracts. The second source of binding is, of course, the require- 
ment that the quality accepted as meeting the specifications be judged 
in the end as satisfactory by those making use of the product. In this 
case, however, we should note that the binding force is not so much that 
requiring that the quality of product meet the specifications as it is that 
requiring that the quality be found in the end to be satisfactory by 
those making use of the product. In this case, however, it must not be 
overlooked that there is a growing tendency on the part of the majority 
of users of most kinds of goods to place reliance upon the judgment of 
men or groups of men whom they accept as being technical authorities, 
such, for example, as national or international standardizing commit- 
tees. 

In the third place, as previously noted, a producer is sometimes bound 
because of his own future interests to adhere to a specification even when 
such adherence would not be demanded at the time by those whose 
wants the quality is supposed to satisfy. For example, the appreciation 
of high quality often comes through experiencing high quality. One who 
has never heard what a technician would consider to be good music, 
good quality of radio transmission, good quality of telephone transmis- 
sion, or good quality of some musical instrument, might never have the 
desire to experience such. Progress, therefore, often comes by living 
up to a specification of quality even beyond the limits wanted by the 
majority of those concerned at a given time. Jn other words, the pro- 
ducer’s personal interest is often more binding than either or both the 
bindingness of a specification made a part of a contractural relation and 
the immediate interests of the consuming group, if he is to lead the way 
in evolving standards that will later be wanted by the majority. 


44 Custom 


All of us are more or less creatures of habit; all of us are more or less 
influenced throughout life by the habits and the common methods of 
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acting of those around us. We early learn that society always takes a 
revenge of one form or another for a breach of any of its common ways 
of acting and hence as members of any group we feel more or less bound 
to follow the conventions of that group. For example, in our methods and 
means of communicating one with another, we are bound to a large ex- 
tent to the customary use of symbols, either written or spoken. Even 
the meaning of a written specification of quality so far as the majority 
of a group or society is concerned inherently depends to a large extent 
upon the customary interpretation of words and other symbols used 
therein. It is to be expected that custom should play a part in the pro- 
duction of standards. Thus a long while before the development. of 
written specifications of standards of quality there existed unwritten 
standards, as it were, fixed by the customs of certain groups. At least 
the meaning of certain words was sufficiently common to members of a 
group to enable the interchange of goods. 

With the development of mass production practices first introduced in 
the eighteenth century, there has grown up an ever-increasing apprecia- 
tion of the economic advantages to be attained by securing a high degree 
of uniformity in the quality characteristics of a given kind of thing. It is 
significant for what follows that there exist at least three ways in which 
customary quality may differ from specified quality in such a way as to 
constitute a part of the standard which is inherently binding upon the 
group. 

In the first place, a given kind of product produced over an extended 
period of time in considerable quantities may exhibit a uniformity in 
quality characteristics not specifically expressed in the specifications of 
the form Sx, and Sy, . In the second place, one or more quality charac- 
teristics may be specified to have magnitudes lying within a definite 
range although experience has shown that over a certain period in the 
past in which many pieces of this kind have been produced the magni- 
tudes of the particular quality characteristics thus specified have differed 
from their specified values but in a way which has been acceptable from 
the viewpoint of use. For example, take the case where the production of 
a new kind of product is started in which the specification of one of the 
important quality characteristics, such as length of life, is that it shall 
not be less than some specified value. Let us assume that N pieces of this 
kind of product have been made and put into service and that the ex- 
perience thus obtained shows that the lengths of life of these N pieces of 
product have been distributed uniformly about an average length L 
considerably above the specified length S. Particularly if the number N 
of pieces of this kind of product that have gone into service is large and 
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if those making use of this kind of thing come to expect an average life 
of approximately L, even though the specification simply calls for a life 
not less than S, most producers would feel bound in certain ways to 
maintain a quality not assignably less than L. It is quite likely, to say 
the least, that some consumers of this kind of thing might feel justified in 
registering a complaint if they should find in the future that the length of 
life of this kind of thing was significantly lower than L even though it 
did not fall below S. In the third place, even though no specific mention 
is made of the fact that in the specification, users of a given kind of 
product may reasonably expect that observed variability in the quality 
characteristics specified should be no larger than that which for economic 
reasons should be left to chance. For example, consider the class of users 
of a given kind of thing such as an automobile. If we find upon compar- 
ing notes with our neighbors or others using the same make of car that 
ours differs from theirs in a way which we consider undesirable, we are 
likely to feel like registering a complaint. 

In rendering quality judgments the quality judge must take into ac- 
count at least these three ways in which custom may effectively consti- 
tute a part of the standard of quality binding in a given case. In fact, 
he not only must take into account custom in certain instances but in 
fact, as we have seen in the previous section, he must also in certain ways 
help in establishing custom, as, for example, in the analysis of results of 
inspection and the determination of economic limits of variability. 

The ultimate source of binding force in maintaining uniformity is quite 
naturally the consumer’s desire for uniformity. Such a common want, 
however, is in a certain sense potentially of legal binding in the sense that 
many statutes as well as common law: have their origin in custom. In 
any case, the degree of binding depends among other things upon: the 
available evidence of the existence of a custom; how long and how con- 
tinuously it has existed; whether or not the custom has been peaceably 
enjoyed; to what extent those affected have regarded it a duty to follow 
the custom; and whether or not the custom in question is consistent 
with all other accepted customs. 


4.5 Precedent 


- To begin with, it is desirable to clarify the distinction here made be- 
tween custom and precedent. Custom, as we have seen, is of the nature 
of an established practice that has more or less gradually come into 
existence. Precedent, on the other hand, arises in the judgment in re- 
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spect to the quality of product that has already been produced as to 
whether or not it is or was of standard quality. Precedent arises, in 
other words, in the finding of the quality judge. If it were feasible to 
write specifications of quality that were ideally necessary and sufficient 
for satisfaction, adequacy, and dependability at an economic cost, and 
if it were feasible to determine with certainty whether or not the quality 
of a given article met such specifications, there would be little, if any, 
occasion to consider the réle of precedent. Since, however, this is not 
feasible, there are three types of judicial findings which are important 
in quality control. , 

Cases of non-conformance with specified requirements are bound to 
arise where the information available at the time justifies the judge of 
quality in concluding that, under the specific conditions existing in the 
case, the quality, even though non-conforming, is acceptable. Likewise, 
conditions are bound to arise where, even though the quality of a given 
thing does conform to specifications, it may not be acceptable. This 
follows at once from the fact that we are not able to state the necessary 
and sufficient quality requirements. This class of precedent arises as a 
natural consequence of looking at a standard as a means to an end, rather 
than as an end in itself. 

Just as common law arises for the most part in the judicial recognition, 
interpretation, and formulation of custom, so also does the effective 
control of custom in standardization come about through the recogni- 
tion, interrelation, and formulation of custom on the part of the judge 
of quality. Thus judicial declarations or recognitions of the existence of a 
custom constitute another source of precedent. In quality control one 
of the very important examples is the judicial decision as to whether or 
not a custom has been established with regard to the degree of varia- 
bility which should be left to chance. 

A third source of precedent is interpretation: first, interpretation of 
the operational meaning of a standard even in so far as it is specified; 
second, interpretation of the sampling technique required in order to 
give adequate information upon which to render a judgment; and third, 
interpretation of the rules of judging and interpreting evidence as to the 
quality of product. 


Vv. CONCLUSION 


The practical meaning and significance of a standard of quality is 
largely determined by the end which it is supposed to serve in use and by 
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the nature and degree of the binding force or sanction accorded it by the 
group interested in or affected by the standard. The standard itself may 
originate in one or more of the five sources: natural law, authority, speci- 
fication, custom and precedent. In any case the judge of quality is not 
handed a standard ready made with which to compare the quality of 
any given kind of manufactured goods — instead he of necessity plays 
an important réle in shaping and determining the standard as derived 
from these sources. 


Contribution of Statistics to the Develop- 
ment Program of a Transformer for 


the L3 Carrier System 


By G. J. LEVENBACH 
(Manuscript received August 20, 1957) 


Statistical methods played a significant part in the development program 
of the L3 system. Experiments were designed to assist in improving the 
manufacture of the input and output transformers of the amplifiers. De- 
tatled analysis of a few of these experiments is presented. 


I. INTRODUCTION 


In previous issues of THE Brett System TECHNICAL JOURNAL the 
problems in design, development and manufacture that were encountered 
in building the L3 coaxial carrier system are described. This system 
provides 1,860 one-way telephone channels or 600 one-way telephone 
channels plus one TV channel over each coaxial tube. The L3 system is 
capable of transmitting a television signal over a distance of approxi- 
mately 1,000 miles and telephone signals, approximately 4,000 miles. 

From the start of the development program, statistical methods have 
played a significant part. Special acceptance procedures have been set up 
to assure that the shipped product would meet certain distribution re- 
quirements.! Control chart techniques were generously applied both in 
the manufacture of component parts and for subassemblies.? This paper 
gives in part a case history of one of the difficult components. The view- 
point is that of the experiments designed to overcome difficulties in the 
initiation of the manufacturing process and to explore possibilities of 
improvement of the component. 

A detailed discussion of the present manufacturing techniques of this 
component, the input and output transformer of the amplifier, has al- 
ready been presented by Earle.’ That paper will be used freely to pro- 
vide the technical details and pictures necessary for an understanding of 
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the experiments. No basically new statistical designs were employed in 
this development. The main interest lies in the fact that these experi- 
ments together with the engineering design and the manufacturing opera- 
tions, including the appropriate process controls and inspection tech- 
niques, were integrated in the development program. 

An endeavor is made in this paper to point out the logical link between 
the statistical analysis and the engineering consequences. Advantages 
of the use of statistical methods in experimental work are as follows: 

1. In designing an experiment (the adjective ‘‘statistical’’ will be im- 
plied from now on), the type of analysis to be performed on the data is a 
major consideration from the start. In some experiments one might wish 
to determine one or several of a larger number of factors which have an 
important effect. In this case the analysis should yield a statement about 
the significance of the effects of the operating factors, with a predeter- 
mined small risk of being wrong. In other cases one looks for quanti- 
tative measures of one or more properties and then the statistician will 
estimate intervals within which, on the basis of the experimental results, 
one can expect with a high probability, the true (unknown) value of these 
measures to lie. 

2. Under the limits set by the requirements in the preceding para- 
graph the design will be such that the experimental effort is minimized. 

3. The design will take into account the adverse effects on the preci- 
sion of the experiment caused by known ambient conditions which are 
not completely under control of the experimenter. 

4. In so far as possible, safeguards against effects from unknown fac- 
tors will be incorporated in the designs. 

The preceding points require that quantitative notions be intro- 
duced as much as possible, not only for the things measured but also for 
the operating factors and disturbances. The experimenter and the stat- 
istician try to agree on a statistical model, describing the expected 
behavior of the physical items in the experiment. Given the model, the 
statistician can suggest experimental arrangements, in an efficient way 
with respect to the experimental effort, which should yield reliable in- 
formation about the problem at hand.. 

In many cases it turns out, when the observations become available, 
that the model has to be modified or that the experiment has not been 
performed according to the design. This usually increases the burden on 
the analysis. It happens occasionally that the data do not show definite 
results, and further experimentation is needed. In that case the careful 
statistical analysis might yield clues in what direction to proceed as well 
as useful quantitative information about disturbing factors, experi- 
mental errors, etc. 
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It has been pointed out that a difference between agricultural and in- 
dustrial experiments lies in the time factor involved. Extension or repeti- 
tion of agricultural experiments is in most cases only possible at yearly 
intervals. In industry the time schedule is much less restricted. There- 
fore it pays to use involved designs in agriculture even at the cost of com- 
plex analyses. Where it is comparatively easy to start a new or partly new 
experiment, complexity may be too high a price to pay. Moreover when 
experimentation goes on parallel to a production process, speed in obtain- 
ing the results of an experiment is of prime importance. Simplicity of 
design is also valuable when the underlying model is not yet well under- 
stood, as in the early stages of exploratory development. 

In the early stages of the manufacture of a complex component, the 
actual specification has to be written on the basis of the results on a com- 
paratively small number of samples. It can hardly be expected that 
these samples are fully representative of the production items which 
will be manufactured. Nevertheless the design engineer will have to 
determine workable limits to give the manufacturer the opportunity to 
get his production rolling without producing too many items not accept- 
able for use. In the L3 system, studies of the over-all requirements of 
- the system had indicated in which way they had to ke broken down into 
the requirements for the components and subassemblies in order to as- 
sure satisfactory operation. In the case of the transformer under discus- 
sion the electrical transmission requirements were more or less fixed. It 
was the task of the design engineer to translate these requirements into 
mechanical tolerances which could be controlled during manufacture. 
On the basis of the equivalent diagram (Fig. 1) for the transformer, ex- 
tensive calculations had been made to determine the relation between 
the variations of the electrical parameters and the over-all transmission 
response.®: §& 1° Hach of the electrical parameters as shown in Fig. 1, a 
simplified picture of the equivalent diagram, does not necessarily cor- 
respond to a discrete part of the physical transformer, but the diagram 
can be considered to represent a model, which lends itself to mathemati- 
cal treatment. Mathematical considerations, statistical or otherwise, on 
the basis of the model, help to establish the mechanical requirements 
for the manufacture, as will be shown later. 

A few of the experiments performed to quantify the underlying rela- 
tionships will be presented in a logical order. Although, through the pres- 
sure of circumstances, the actual experiments did not proceed in a 
strictly orderly fashion, the general line of experimentation was that 
described in this article. Production was progressing in parallel with 
this experimental program and, as described elsewhere,” control charts 
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showed several assignable causes of variation in the parameters, which 
were removed by improvements in manufacturing techniques. 

The experiments selected to illustrate the development program will be 
discussed in some detail. In terms of their most important results these 
experiments can be described as follows: 

1. Pinpointing the input and output network (Fig. 2) as the major 
source of variation. The transformer (Fig. 3) is the main component in 
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Fig. 1 — Coupling networks circuits. (a) Physical elements. (b) On ground 
equivalent circuit, adequate for gain and feedback computations in an amplifier 
configuration employing ground coupling networks. 
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Fig. 4— Input (sub) amplifier block diagram subdivision for hyper-graeco- 
latin square experiment. é 
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Fig. 5 — Exploded schematic of the 2504A transformer. 
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these networks so that subsequent experimentation was concentrated 
on the transformer. 

2. Determining the required manufacturing limits for the wall thick- 
ness of the outer winding form of the transformer, (Fig. 5, 6). 

3. Determining the required manufacturing limits for the “cutback” 
of the shield under the outer winding of the transformer. (The term 
“cutback” will be explained later.) 

4. Comparing the over-all measured response of the complete amplifier 
with its predicted performance as based on a detailed knowledge of the 
components obtained from the designed experiments. 


II. FINDING THE NETWORK CAUSING MOST OF THE UNWANTED VARIATIONS 


From the first series of amplifiers manufactured, it appeared that the 
differences between the measured transmission gain curves for the various 
amplifiers were larger than could be tolerated. 

For this discussion it is sufficient to represent the amplifier as in Fig. 2. 
The blocks represent subassemblies which are mechanically designed 
so that a high degree of reproducibility in the location of the components 
and the connected wiring is achieved. It is therefore feasible to inquire if 
one or two of the subassemblies are responsible for the bulk of the varia- 
bility in measured gain. It is worth noting that the “large” variations 
are not large when compared to the capabilities of the measuring equip- 
ment. The over-all admissible amplifier gain variations are in the order of 
0.2 to 0.3 db corresponding to voltage variations of less than 3 per cent. 






FIRED SILVER_ 
OUTER SHIELD ~~~ 


Fig. 6 — Outer winding form and detail to show ‘‘wall thickness.”’ 
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Consequently, to be able to discriminate between the contributions of 
the individual components one must be able to measure reliably to as 
close as, say, 0.01 db, i.e., to detect voltage variations in the order of 0.1 
per cent. This approaches the presently attainable precision of these 
types of measurements. Finally, these subassemblies are fairly expensive 
and were not in plentiful supply at the time these experiments had 
to be run. 

Practically, it was reasonable to treat the input and output amplifiers, 
as indicated in Fig. 2, as separate entities. Each of these two subamplifiers 
can be measured accurately for its transmission gain in the same way 
as can be done with the completed amplifier. In this fashion a direct 
relationship exists between the results of sub- and complete amplifiers. 
This favorable condition does not exist with respect to the relationship 
between sub-amplifiers and its subassemblies which are also indicated 
in Fig. 2. To determine if the subassemblies meet the over-all require- 
ments, it is necessary to combine them into sub-amplifiers and measure 
those. 

Input and output amplifiers consist basically of the same subassem- 
blies. The type of designed experiment used for both sub-amplifiers was 
identical so that a detailed example for the input-amplifier tells the main 
story. It was felt from engineering considerations that interactions be- 
tween the various subassemblies in an input or output amplifier would 
be of a considerably smaller magnitude than the variations of interest 
and therefore could be neglected. 

Four types of subassemblies make up a sub-amplifier, so these four 
should enter as factors in our experiment. As was pointed out above, a 
set of subassemblies has to be assembled into an amplifier to make 
transmission measurements possible. To evaluate this procedure, every 
time the set of available subassemblies was combined into sub-amplifiers 
it was considered a run. This gives the following factors to be used in 
the experiment: 

Runs 
Coupling Networks 
Interstage Networks 
Beta Networks 
Chassis 
The number of levels for each of the factors is determined below. 

The experimental design should incorporate five factors and minimize 
the number of required subassembly units; however it does not have to 
measure interactions. An experimental design that lends itself to this 
type of situation is a hyper graeco-latin square.’ 

Assigning, as is shown in Table I, the rows to the different runs and 
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Taste I. — Hyper Grarco-Latin SQUARE Layout 























Chassis 
Run 
No. 
1 2 3 4 5 6 7 

1 Ala B28 C3y D498 Kde F6¢ G7n 
2 D3 E4y Fd? G6e A7¢e Bly C2a 
3 G5by A6d. B7Ze Clg D2n E38a F 48g 
4 C70 Die 2¢ F3n G4a Ads B6y 
5 F2e G3¢ AAn Bda C68 D7y Elev 
6 B4¢ C5n D6a E78 Fly G20 Ade 
7 E6n Fla G1g A2y B3e C4e Db5¢ 








Latin letters—Coupling Networks 
Greek letters—Beta Networks 
Numerals—Interstage Networks 


the columns to the different chassis, we can allocate the coupling net- 
works, identified by latin letters, so that each occurs exactly once in 
each column and row. This results in a latin square. If we add to this 
. structure two more arrays, one composed of greek letters, identifying the 
beta networks and one composed of numbers identifying the interstage 
networks, such that each letter or number occurs only once with each 
other symbol we have an (incomplete) system of ‘orthogonal squares’’. 
Data from such a pattern will allow us to obtain unbiased estimates of 
the main effects of the five factors incorporated, in the absence of inter- 
actions. Moreover, the estimates for one factor will be statistically un- 
correlated with those for other factors. 

The square in Table I is of size 7 X 7. This is the smallest practical 
size that could be applied. For 5 factors a square of size 5 X 5 could in 
theory be used as four different orthogonal squares of this size exist,’ 
but we would have only four degrees of freedom to estimate our error. 

No orthogonal squares of size 6 X 6 exist. Ina 7 X 7 we have 49 ob- 
servations and 18 degrees of freedom for error. For this experiment 7 
units of each type had to be assembled 7 times into a set of 7 amplifiers 
each. The first set of 7 amplifiers was numbered | to 7 in random order, 
thus at the same time identifying the subassemblies. The complete lay- 
out of the experiment is given in Table I. 

Measurements on the completed input amplifiers were made at the 
highest frequency of interest in the transmission band, 8.3 mc, and are 
listed in Table II. The analysis of variance computed in the usual manner 
from these data is presented in Table III. Apparent measurement stand- 
ard deviation ¢ = +/0.000254 = 0.016 db. 

It is evident from the sums of squares column in the latter table that 
the coupling networks contribute a very sizeable part of the total varia- 
tion. The experimental error as estimated from the residual mean 
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TaBLE IJ. — TRANSMISSION MEASUREMENTS AT 8.3 Mc IN DB 








Chassis No. 
Run No. 
1 2 3 4 5 6 7 
1 4.739 | 4.799 | 4.935 | 4.713 | 4.824 | 4.998 | 4.870 
2 4.759 | 4.841 | 5.044 | 4.820 | 4.870 | 4.852 | 4.896 
3 4.819 | 4.749 | 4.878 | 4.933 | 4.719 | 4.873 | 4.986 
4 5.003 | 4.749 | 4.866 | 5.001 | 4.797 | 4.761 | 4.836 
5 4.978 | 4.824 | 4.722 | 4.820 | 4.945 | 4.797 | 4.898 
6 4.804 | 4.910 } 4.774 | 4.916 } 5.013 | 4.819 | 4.714 
7 4.897 | 5.056 | 4.861 | 4.701 | 4.827 | 4.913 | 4.748 
TaBLE III.— ANALYSIS OF VARIANCE 
Source D/F |Sums of Squares} Mean Square Significance Level 
Coupling 
Networks 6 0.376359 0.062726 51% 
Interstage 
es 6 0.037422 0.006237 51% 
eta : 
Networks| 6 0.003410 0.000568 
Chassis 6 0.003075 0.000512 not significant at 5% level 
Runs 6 0.003381 0.000564 


Residual 18 0.004634 0.000254 


Total 48 0.428281 


squares amounts to 0.016 db. This disregards the effect of reassembling, 
as indicated by runs, which, however, is not significant at the 5 per cent 
level. It would be possible to pool the run, sum of squares, with that for 
error as estimated from the residual mean square to get more degrees 
of freedom for error but no new insight would be gained by this proce- 
dure. In the type of investigations described a level of significance of 5 
per cent or smaller is generally applied. This implies that the chances 
are 5 per cent or less that, on the basis of the analysis, effects would be 
singled out for further engineering consideration when actually these 
effects are nonexistent. 

To further illustrate the engineering implications, the results of Table 
III can be written in terms of the projected model for this experiment. 
It was assumed that the effects of the members of each of the subassem- 
blies on the amplifier gain were normally distributed. The average value 
of the amplifier gain can be interpreted as the performance of an amplifier 
consisting of subassemblies of exact nominal values. The interesting 
part, however, is the gain variation from amplifier to amplifier, caused 
by the deviations from nominal of the subassemblies. These deviations 
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TaBLE IV.— STANDARD DEVIATION ESTIMATES FOR THE VARIATIONS 
DvE To THE DIFFERENT NETWORKS 





Coupling Networks 0.094 db 
Interstage Networks 0.029 db 
Beta Networks 0.007 db 
Chassis 0.006 db 
Runs 0.007 db 





TaBLeE V. — APPROXIMATE 90 PER CENT CONFIDENCE LIMITS FOR THE 
VARIATIONS Dur To THE DIFFERENT NETWORKS 

















Lower Limit Upper Limit 
(db) (db) 
Coupling Networks 0.065 0.181 
Interstage Networks 0.019 0.056 
Beta Networks 0.0 0.016 
Chassis 0.0 0.015 
Runs 0.0 0.016 








can be measured by the standard deviation of their respective distribu- 
tions. These standard deviations as derived from Table III are listed in 
Table IV and their approximate 90 per cent confidence limits in Table V.9 

It appears again that the coupling networks contribute most to the 
variations in the transmission of the subamplifier. The interstage net- 
works are of secondary importance, whereas the other three factors 
can be neglected. A similar picture emerged from the companion ex- 
periments on the output amplifier. It was therefore logical to concentrate 
first on trying to decrease the variability of the coupling network of 
which the transformer was the main part. 


III. WALL-THICKNESS STUDIES ON THE OUTER COIL FORM OF THE TRANS- 
FORMER 


The transformer, even in its simplified form as in the equivalent cir- 
cuit of Fig. 1, involves many parameters. By numerical evaluation the 
changes in transmission gain due to specified changes in these parameters 
were calculated on the basis of this circuit.5® As has already been 
pointed out, not all of the parameters in the equivalent diagram are 
directly represented in the physical transformer; therefore a relationship 
between the parameters and physical dimensions is not easy to establish. 

From evaluation of the electrical circuit it was felt that the capaci- 
tance at the high inductance side of the transformer, C; in Fig. 1, would 
be a major contributor to the gain variation. Direct correlation between 
the behavior of this capacitance and various mechanical properties on 
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the basis of control charts did not yield sufficiently strong clues, partly 
due to the fact that the measurement accuracy in the production process 
was marginal in view of the small variations concerned. On the basis of 
engineering experience one of the strongly suspected mechanical variables 
was the wall thickness of the outer coil form of the transformer. The 
exploded views in Fig. 5 and Fig. 6 show that the outer form carries the 
winding with the highest number of turns. These turns are ground into 
the vycor glass body and they are subsequently copper plated. A silver 
shield is sprayed on the inside of the vycor glass form and fired subse- 
quently. The “‘thickness”’ of the wall as measured between the bottom of 
the groove and the inner face is about 0.031” and the geometry of the 
situation leads us to expect a strong dependence of the high side capacity 
on the wall thickness. (Fig. 6.) 

The experiment to estimate the quantitative influence of wall thickness 
variations on electrical properties was set up as follows: 

Two batches of 9 transformers each were produced in accordance with 
current production specifications except that batch ‘‘A” contained outer 
coil forms with “thick” walls and batch “B” with “thin” walls. On a 
nominal thickness of about 0.031” batch A was on the average about 6 
ten thousandths thicker than batch B. Due to the difficult’ grinding 
process it was impossible to make all coil forms of the same batch exactly 
alike to the limit of measurement, i.e., to within half a ten thousandth. 
The resulting variation in this thickness within a batch is indicated by 
the standard deviation of 1.5  10-*. 

All these transformers were measured in the same standard amplifier 
and the gain was observed at a number of frequencies. In addition, 
various short-circuit and open-circuit impedances were determined on 
the isolated transformers. Since these impedances bear a direct relation 
to the magnitude of the parameters in the equivalent diagram, one ob- 
tains information about the variations in the parameter values from the 
observed variations in the impedances. Allowing for these variations in 
predicting the performance of the circuit on the basis of the equivalent 
diagram, it is possible to compare the observed gain with that pre- 
dicted. An example of such a comparison will be discussed later. 

After a complete first run of measurements had been made on the 
transformers as manufactured, a second run was performed after the 
thick walled and thin walled coil forms had been interchanged between 
the transformers of batch “A” and ‘B”’. 

Identifying the transformers without a coil form by capital letters and 
the forms by lower case ones in accordance with the batch to which they 
originally belonged, the actual set-up is given in Table VI. This table 
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TasLE VI.— Basic DESIGN FoR WALL THICKNESS 
DEPENDENCY DETERMINATION 














Transformer Batch 
Coil Form 
A B 
a Run 1 Run 2 
b Run 2 Run 1 





represents the experiment only “batchwise”’. It is important to note 
with respect to the model given below, that the interchange of one pair 
of coil forms (one thick and one thin) did not in general take place within 
one pair of transformers (one from batch A and one from batch B). If 
this had been done, a different analysis could have been performed on 
the same amount of data. 

The mathematical model underlying this design takes into account the 
following effects: 


Bh = average level g= 1,2. 

B; = batch a=1,2,---,9 
¢:i,; = transformer 7 in batch 7 7 = 1,2: 

w, = wall thickness k = 1, 2. 

pi = runs l= 1,2. 


€:;, j,%, 1 = residual, being the difference between the measurements of 
the 2** transformer in the jt* batch and its prediction from 
wall thickness, batch and run effect. 


With these definitions the observations y;, ;,%, 1 can be expressed as 
follows: 


Ui gk BS ee Bp ge er ee ee FH ek Ps 


From Table VI it is apparent that the wall thickness is measured by 
the row differences, the batch effect by the column differences and the 
run effect by the diagonal differences. The latter is indistinguishable 
from the row by column interaction, but there were reasons to believe 
that the interactions were of a smaller order of magnitude than the run 
effect. ; 

The results of the gain measurements at one of the frequencies em- 
ployed, 8.3 me, are presented in Table VII, which gives only the frac- 
tional db, expressed in thousandths of db. A constant whole number of 
db is omitted throughout. This incorporates the fixed gains and attenua- 
tions of the measuring set up. 
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TasuLe VIT. — Gain MEASUREMENTS AT 8.3 Mc. Errecr or DIFFERENT 
Wau THICKNESS oF OuTER Form 




















Batch A Batch B 
os rans- | x 0,001 db Frans 1's¢ 0000i/db 
1 744 1 531 
2 778 2 510 
3 723 3 437 
4 698 4 487 
Run 1 ‘Thick?’ Wall 5 738 “Thin”? Wall 5 447 
6 644 6 608 
7 711 7 562 
8 670 8 476 
9 604 9 470 
1 645 1 674 
2 582 2 700 
3 556 3 634 
4 577 4 711 
Run 2 “Thin” Wall 5 582 “Thick”? Wall 5 512 
6 524 6 725 
7 550 7 658 
8 483 8 680 
9 547 9 676 





Taste VIII. — ANALYSIS OF VARIANCE OF WALL 
THICKNESS EXPERIMENT 











D 
Source See of Free: gate Significance Level 
Between batches 20 449 1 20 449 5%t 
Between transformers, 75 126 16 4 695 1% 
. within batches 
Between runs 880 1 880 a eens at 5% 
eve 

Between wall thickness 203 401 1 203 401 <1% 
Within transformers cor- 20 888 16 1 305 

rected for runs and wall 

thickness (error) 

Total — 320 744 | 35 








The analysis of variance of these data is presented in Table VIII. 

It is readily seen from Table VIII that the wall thickness accounts for 
most of the variations, and that the effect of runs is indistinguishable 
from the error. It is possible just as was done in Table V to calculate the 
variance components for these effects and its limits. Both however are 
only based on one degree of freedom which makes this procedure hardly 
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profitable. The batch effect is tested against the “transformer within 
batches” variation, and the level of significance is a little over 5 per cent. 
This indicates that there was a systematic difference between the two 
batches. 

An estimate of residual variation can be obtained from the two obser- 
vations on the same transformer corrected for the estimated differences 
due to wall thickness and run effects. The standard deviation for error 
is ¢ = +~/1305 = 36 or 0.036 db in actual units. This can be compared 
to the stated goal of 0.01 db and the result of the preceding experiment 
0.016 db. The two averages computed for the different wall-thickness 
groups, y..z, provides us with an estimate of the effect of the average 
change in wall thickness on the gain: . 

For the ‘“‘thick” wall the estimated gain is 0.682 db. 

For the ‘‘thin” wall the estimated gain is 0.532 db. 

Average increase of 0.006” in wall thickness results in an increase of 
0.150 db at 8.8 me. In order to find out if the experiment was sensitive 
enough to find the dependence on wall thickness of the transmission 
measurements of the individual transformers, the residuals, as calculated 
from the equation on page 35, are plotted against the measured wall 
thickness, Fig. 7. The measurements of the wall thickness could be read 
to the nearest 0.00005”, but as seen in Fig. 7, the variations are too great 
to show any significant correlation with the fine structure of the wall 
thickness. 





RESIDUAL x 0.001 DECIBELS 








WALL THICKNESS DEVIATION IN MILS 


Fig. 7 — Residual variations, after the systematic effects have been removed, 
as a function of the wall thickness variation. 
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This experiment showed that it was necessary to control the wall 
thickness as closely as would be cconomical. The practical limit was 
known and the resulting transmission variations as estimated from the 
findings in this experiment, would be satisfactory from. the over-all 
systems point of view. 


IV. STUDY OF SHIELDING AND WINDING TERMINATION 


Another mechanical variable to be considered is related to the ter- 
mination of the winding on the outer form. One side of the winding (ter- 
minal No. 4) is connected to the shield that covers the inside of the 
coil form (Fig. 8). The other end has to be connected to one of the ter- 
minals (No. 5) on the body of the transformer. Electrically this latter 
point is sensitive and should be shielded as much as possible. On the 
other hand, in order to be able to connect the terminal lead to the wind- 
ing a tab is inserted on the form. The shield must be cut back sufficiently 
to avoid short circuiting the winding via the tab. Originally a 0.150” 
cutback was employed. Mechanical limitations make variations around 
the nominal cutback value unavoidable. The following experiment was 
set up to find out which nominal cutback value would result in the small- 
est variations in the transmission gain of the transformer. 
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Fig. 8 — Side view of outer cylindrical spool, as per Fig. 6. 
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Another variable had been introduced into the problem inadvertently 
in the manufacturing process. This variable was related to the same sensi- 
tive point of the winding, and consisted of the amount of run-out or extra 
winding cut by the grinder beyond the point where the terminal tab 
No. 5 was connected to the winding. The run-out is measured in degrees 
of arc. Originally the run-out was kept close to 28°. After some manu- 
facturing changes required for other reasons, the run-out variations be- 
came much larger. It was thought important to examine cutback and 
run-out at the same time to find any interaction effects if present. 

An experiment to determine effects of cutback and run-out faces a 
difficulty similar to the previous one. The only hope to detect these 
effects is to try out the same transformer with different cutback and 
run-out values. This implies disassembling and re-assembling the trans- 
formers as many times as changes in the variables are made. In addition 
the change in variables can only go in one direction: the cutback can be 
increased by taking away a little bit of the shield and the run-out can 
be decreased by removing part of the run-out winding. 

In accordance with these conditions an experiment was designed as 
indicated in the flow chart of Fig. 9, covering the possible combinations 
of applied changes in cutback and run-out in a systematic manner. 

The cutback value of 0.150” and the 28° run-out were the standard 
values in the manufacture at the time of the experiment. The stages of 
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Fig. 9 — Flow chart of applied changes. 
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reassembly are indicated in order. The starting point for each trans- 
former was 0.100” cutback and 28° run-out. 

This is an example of an experiment where several mishaps distorted 
the original design — a not unusual occurrence. Due to the time and 
costs involved the experiment was not repeated but a special effort was 
made to recover the information sought. 

As in the previous experiment the transformers were measured in an 
amplifier to determine the gain characteristic as a function of frequency. 
In addition a few characteristic parameters were measured on the trans- 
former itself. 





Vig. 10 — Jig for transformer measurement. 
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TABLE IX. — GAIN MEASUREMENTS At 7.8 Mc IN THOUSANDTHS OF DB 

















- Cut-Back X 0.001” 
T nen ounes Run-Out 
100 120 150 180 195 
1 0 = 203 226 = 377 
28 = = = a = 
2 : 154 166 216 _ 360 
2 = = ss — — 
3 0 = — 242 = 344 
28 — 216 240 — — 
4 0 = = == = 351 
28 _ 193 264 340 — 
5) 0 243 184 227 — 333 
28 _ — a — _ 
6 0 — — — = 377 
28 — 184 242 324 — 


At the second stage of the experiment, Fig. 9, it appeared that the 
precision of measurement was rather poor due to the differences occur- 
ring when the transformer was disconnected from the amplifier and after 
the change in cutback and/or run-out reconnected by means of soldering. 
It was therefore decided to construct a contact fixture allowing the 
transformer to be plugged in and out of the amplifier. 

For the first time after the fixture shown in Fig. 10 became available 
the transformers were measured twice — once soldered into the ampli- 
fier and once plugged in. This was done after the second reassembly and 
the previous measurements were adjusted to the fixture readings on the 
basis of this comparison. Almost all of the initial measurements (State 0) 
had to be discarded. 

An additional deviation from the design occurred in the final stage 
when some of the transformers were cut back too far, to 0.195” instead 
of 0.180”. ; 

As an example the gain measurements at 7.3 me are listed in Table IX. 

When considering results such as in Table IX for further analysis the 
question arises what type of model should be fitted to the data. It goes 
without saying that apart from fitting the data the choice of the model 
must primarily make sense from an engineering standpoint. For designs 
like the hyper graeco-latin square of Section II and balanced designs in 
general the computational part of the analysis is small, measured in 
man-hours on a desk calculator. Changing the model in those designs 
by incorporating more factors or discarding alleged superfluous ones is 
simple, as the estimates of the effects of these factors in balanced situa- 
tions are independent of the others. 

In a case like in Table [IX where no reasonable balance is left but where 
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the operating factors (cutback, etc.) are measurable or quasi measurable, 
regression models are indicated. The computational effort on a desk 
calculator to estimate the parameters in the regression model is consider- 
able for three operating factors, as in our case. To explore a sufficient set 
of modifications of a model for four or more factors is only practical if 
an automatic computer is available. 

As a first step in the analysis a linear multiple regression equation on 
three variables was calculated, the independent variables being: 


x, : number of resolderings 
2 : run-out 
23 : cut-back. 
The model fitted was: 
Y — 9 = Bilt — 41) + Bolte — %2) + Bas — 4s). 
Estimates b of the 6’s resulted in 


b, = —0.023 db/step 
be = —0.0028 db/degree 
bs = 0.0052 db/mil. 


The corresponding analysis of variance table is Table X. Having a set 
of numbers it is always possible to go through the calculations and obtain 
estimates of the 6’s. The important part, however, is to determine how 
well the model fits. Looking at the analysis of variance Table X it appears 
in this case that a substantial part of the total observed variation as 
measured by the total sum of squares is explained by the model. The 
variations taken care of by the model are accounted for by the sum of 
squares for regression. The remainder measures our error. The esti- 
mated ¢ from the residual is ~/0.000630 = 0.025 db. 


TABLE X,— ANALYSIS OF VARIANCE FOR LINEAR REGRESSION 
ON 3 VARIABLES 























Source SS D/F MS 
Regression 0. 102845 3 0.034282 
Residual 0.012596 20 0.000630 

Total 0.115441 23 
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TABLE XI.— ANALYSIS OF VARIANCE FOR LINEAR 
REGRESSION ON X) AND X3 


























Source SS D/F MS 
Regression x3 alone 0.096059 1 0.096059 
Improvement due to x added 0.002085 1 - 0.002085 
0.098144 2 
Residual 0.017297 21 0.000824 
0.115441 











Estimated ¢ = 0.029. 


It is of importance to find out the magnitude of the contribution by 
the individual independent variables x; to our model. The general way 
of doing this is to drop one or more of the independent variables, recom- 
pute the estimates for the regression coefficients for the remaining vari- 
ables and study the result in a new analysis of variance table. 

As an example consider the simplified model 


Y — 9 = 63(x3 — 2s) 


and ask for the importance of incorporating the reassembly variate 2; 
into this model. We can list the results as in Table XI. The improve- 
ment due to the addition of 2; is not significant at the 5 per cent level. 

Fig. 11 illustrates this procedure for a number of possibilities. What- 
ever model for fitting is chosen the total sum of squares is the same. The 
horizontal line at the top of the picture corresponds to this value of 
0.115441 (db)? (Table X). The length of the bars shows the part that is 
explained by incorporating in the model the variables listed at the bot- 
tom of each bar. 

The run-out 2x2 by itself does not appear to contribute anything ap- 
preciable, although in combination with resoldering 2, it shows up a 
little. Cutback x3 alone accounts for the bulk of the variation. Resolder- 
ing x; also shows up alone, but once 2; is incorporated, addition of 2 is 
not too important. This behaviour corresponds to the very strong 
correlation (correlation coefficient = 0.93) between the independent vari- 
ates x, and x3. This correlation stems from the fact that an increase in 
cutback necessarily corresponds to a later resoldering. 

Engineering considerations suggested that the amount of non-linearity 
due to the cutback variable x3; should also be examined. Cutbacks 
smaller than about 0.150” do not reach under the first turn of the wind- 
ing (Fig. 8) so they do not influence the shielding operation as strongly 
as when the cutback exceeds 0.150”. 
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TOTAL SUMS OF SQUARES OF VARIATIONS 
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Fig. 11 — Contributions of various factors to the sums of squares of regression. 


Introducing a quadratic term in the model 
(Y — 9) = Bilas — &s) + Bulas? — 2°) 


gives the best fit to date as shown in Fig. 11. Run-out and resoldering 
are now left out, the former making no significant contribution and the 
latter being sufficiently taken care of by its correlation with the cutback. 
After all the resoldering was only of interest in the experimental situa- 
tion, and did not occur in actual production. 

Estimating the parameters yields 


Y — 0.470 = 0.005 x3 + 0.000023 2? db 


when 23 is the cutback in 0.001”. The residual error standard deviation 
¢ = 0.023 db. Predicting some values 


Cutback Gain 
0.120” 0.201 db 
0.150” 0.238 db 
0.180” 0.315 db 


shows that 0.030” less cutback with respect to 0.150” makes a difference 
of about 0.04 db, whereas 0.030” increase changes the gain by almost 
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0.08 db. Since gain should be insensitive to the variations in cutback 
which occur in manufacture, it was decided to keep the nominal cutback 
value at 0.120”. 

In the analysis of each of the above experiments only one set of meas- 
urement results has been discussed. With the particular type of measur- 
ing set used, the gain of the amplifier is obtained as a continuous curve 
over the whole frequency range of interest. At about ten different fre- 
quencies ranging from 0.3 to 8.5 mc the results have been analyzed in 
the way described. In addition several discrete impedances in the trans- 
former closely related to the elements in the equivalent diagram, Fig. 1, 
were measured directly. 

In such a situation a very important check can be made about the as- 
sumptions underlying the experimentation and the analytical approach. 
On the one hand, we have the measurements of the performance of the 
transformer in the circuit and the measurements of various impedances 
connected with leakage, stray capacitances, etc. of the transformer. On 
the other hand, we have the analytical study of the model in the form 
_ of the equivalent diagram, Fig. 1, which provides us with a prediction of 
the over-all performance from the values of these impedances. If this 
prediction is sufficiently close to the measured over-all performance we 
can use control of the impedances to control the performance. In addi- 
tion we can use the model for studying the consequences of contemplated 
major changes in the design. 

From the point of view of guaranteeing reliability of complex systems 
it seems to be essential that a model as close to reality as possible be em- 
ployed for prediction. 

Comparisons between prediction from the equivalent diagram, Fig. 1, 
and measured curves have been made for the different experiments in the 
development program. Fig. 12 presents such a comparison for the pre- 
viously described ‘‘wall-thickness’”’ experiment. The changes in imped- 
ances observed corresponding to a change in wall thickness of 0.0006” 
were fed into the formulas derived®: ° for the equivalent diagram. The 
resulting predicted gain values, together with the measured gain values, 
are plotted as a function of frequency in Fig. 12. Remembering the order 
of magnitude of the estimates for the error standard deviation, a few 
hundredths of a db in this type of transmission measurements, the agree- 
ment is satisfactory. 


V. FINAL EVALUATION OF THE TRANSFORMER 


The results of experiments like the ones described contributed to the 
tying down of specifications and controls in the manufacture of the 
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fe=—“ PREDICTION ON BASIS OF 
TRANSFORMER PARAMETERS 


O=——O MEASURED IN AMPLIFIER 


TRANSMISSION IN DECIBELS 





FREQUENCY IN MEGACYCLES PER SECOND 


Fig. 12 — Comparison between measured and predicted transmission for a 0.6 
mil increase in wall thickness. 


transformer. As the measures derived from each of the experiments re- 
lated only to a detail of the transformer, it was considered necessary to 
set up an experiment incorporating the results of the various tests, in 
order to examine the over-all performance of the transformer, in a 
complete amplifier. 

In other words, it would be useful to confirm that the gain variations 
in the amplifier dependent on the (uncontrollable statistical) variations 
in the electrical parameters of the transformer are small enough to 
satisfy the systems designer. The experimental scheme adopted for this 
purpose called for a fair sized number of transformers basically belong- 
ing to two groups: 

a. One group of transformers conforming to the current specifications 
and of recent manufacture at the time of this experiment. 

b. One group of transformers consisting of recent rejects and all other 
old transformers that could be found, all having one or more parameters 
outside the specifications. 

These transformers would be very carefully measured in the Labora- 
tories, taking special care and using the best measuring equipment 
available. (The previous experiments described in this paper had been 
conducted in Western Electric factories.) 

From the measured values of the parameters such as leakage induct- 
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ance, stray capacitances, etc., the predicted gain would be computed, 
again using the formulas derived on the basis of the equivalent diagram. 
The computed gains would finally be compared to the measured ones. 
It was hoped that this experiment would show two things: 

1. That recently produced transformers which showed satisfactory 
parameter-measurement results would yield good amplifiers. 

2. That the parameters chosen for control measurements in the trans- 
former manufacturing process were adequate to reasonably predict the 
over-all transmission performance in the amplifier. 

The experiment was preceded by a pilot experiment to test the gain- 
measuring equipment. In both steps of experimentation two jigs for gain 
measurements were to be used, consisting of almost identical sub-ampli- 
fiers, and measurements at 15 frequencies between 0.3 and 8.5 me were 
to be made. The pilot experiment was designed such that an estimate of 
the jig differences and of the influence of time could be made. In addi- 
tion the magnitude of residual error could be determined. 

Eight transformers were measured twice in each of the two jigs in the 
following sequence. (Table XII.) 

As an example let us again choose the results at a high frequency, as 
the sensitivity of the transformer and amplifiers for small deviations 
from the ideal increasés with frequency. 

The time effect will be judged by the difference between the first and 
second half of the experiments, called H; and He respectively. 

Disregarding the time sequence in each half, which can always be 
recovered if so desired by examining the residuals, the results coded as 
before in thousandths of db are given in Table XIII. The analysis of 
variance is given in Table XIV. Using the three-way interaction as a 
measure of residual variation Table XIV shows that the transformer by 
time and the jig by time interactions are unimportant. The transformer 
by jig interaction although not significant at the 5 per cent level is dis- 
turbing in an experiment of this kind. This might indicate that contact 
trouble exists between the transformer and the jig. The transformers 
were not soldered in the jigs but contact was made by means of springs. 


TaBLE XII. — TRANSFORMER NumBers IN TIME SEQUENCE OF 
MEASUREMENT FROM Lert To RIGHT 





mM He 
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TABLE XIII. — Proot Expertments 8.3-Mc Gain MEASUREMENTS 
oN “MicroBeL” Test Ser. Units 0.001 pp 

















Jig 1 Jig 2 
My Az Ay He 
Tr. 2 765 777 890 888 
2 652 672 797 777 
3 812 814 920 910 
4 760 747 915 927 
5 832 840 961 938 
6 775 743 909 887 
7 756 757 889 878 
8 698 705 832 820 
Average for Jig 1 756 Average for Jig 2 884 


TABLE XIV.— ANALYSIS OF VARIANCE OF PILOT EXPERIMENT 




















Source SS D/F MS pee 
Between Jigs 129159 1 129159 K1%G 
Between Transformers 79930 7 11419 K1% 
Between Time 256 I 256 >10% 
Transf X Jigs 2684 7 383 =T% 
Jig X Time 229 1 229 >10% 
Transf < Time 602 7 84 >25% 
Transf X Jigs X Time 803 7 115 

Total 213663 31 





In the main experiment following this pilot one, contact trouble arose 
again. Moving up in the table the time effect appears negligible. The sig- 
nificant differences between transformers do not have to be considered 
as this reflects only the differences in their nominal gain, but the jig 
effect is highly significant even with respect to the transformer by jig 
interaction. 

It would have been unrealistic to expect the jigs to be equal because 
of their complexity. What was hoped was that the difference between the 
two would be substantially constant. From the averages listed in Table 
XIII, we estimate the difference between Jig 1 and Jig 2 as 0.128 db, 
with 90 per cent confidence limits of 0.114 to 0.142 db based on standard 
deviation for the average difference of 0.008 db with 15 degrees of free- 
dom. For this latter estimate the jig interactions were pooled with the 
“error’”’ variance. 

If the variations between jigs would remain within the above limits in 
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the main experiment yet to be made, this would be reasonable. However, 
the jig by transformer interaction tells us to be on guard. 

The main experimental design following this pilot study is presented 
in Table XV. The intent was to obtain units with as wide a spread of 
properties as possible. Then, as explained in the beginning of this sec- 
tion, we could see if the formulas which predict the over-all gain from 
the detailed impedances of the transformer would hold over a wide 
enough range. In each period all the transformers listed were measured 
in one jig and then in the other. The jig sequence was varied from 
period to period. Transformers meeting specifications and rejects were 
collectively randomized over serial numbers. Therefore 50 good trans- 
formers of recent production were combined with 33 rejected ones. The 
latter were rejected for a variety of reasons and over a considerable 
period of time. In principle, no special design is necessary to obtain ob- 
servations for comparing detailed measurements of a transformer to the 


TABLE XV.— MEASURING SCHEDULE FOR TRANSFORMERS 
IN TERMS oF THEIR SERIAL NUMBERS 








Runs = Days 1 | 2 | 3 | 4 
Jigs 
1 | 2 1 2 1 2 1 2 
Morning 1 | 10 | 22 | 25* | 48 | 50 | 64 | 72 
2 3 23 26 44 46 65* | 73 
3 2 24* | 27 45 44 66* | 71 
4 1 25* | 28 46 45 67 64 
5 6 25 22 47 49* | 68 66* 
6 5* | 27 29 48* | 47 69 67 
7 4 28 30 49* | 48* | 70 65* 
8 8 29 -| 31 50 51 71 69 
g* 7 30 23 51 52 72 70 
10 9 31 24* | 52 43 73 68 


Afternoon 11 | 20* | 32 | 35* | 53: | 59* | 74 =| 79 
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performance of an amplifier containing the same transformer. But the 
time involved in measuring more than 80 transformers in each of two 
jigs is several days, so the possibility of time effects had to be watched. 
First, the numbers in the design were assigned at random to the pool of 
good and rejected transformers. Second, to keep a running check on the 
precision of the measurements a number of observations were repeated 
on different days (runs). In each pair of adjacent runs, and in the last 
and the first, a set of four transformers was replicated both in Jig 1 and 
Jig 2. From Table XV it can be seen that these linking sets are the 
following: 


Run I and II Transformers 24, 25, 35, 40 
Run II and III Transformers 48, 49, 57, 59 
Run III and IV Transformers 65, 66, 77, 80 
Run IV and I Transformers 5, 9, 14, 20 


As a further precaution, which it was found not necessary to use in the 
analysis, half of the transformers in the sets above were replicated in the 
same period of the day, the other half in different periods. For Runs I 
and II we find from Table XV, in Jig 1, transformers 24 and 40 in the 
same periods, transformers 25 and 35 in different periods, in Jig 2, 
transformers 25 and 35 in the same periods, transformers 24 and 40 in 
different periods. A typical analysis for one linking set disregarding the 
period allocation, is shown in Table XVII for the observations taken at 
8.3 me and listed in Table XVI. 

Both the interactions of jigs and runs and jigs and transformers are 
significant at the 5 per cent level. The run main effects mean square is 
not significant but the interactions with the jigs are disturbing. These 
interactions showed up to a greater or lesser extent in all the compari- 
sons, both in those similar to this one and in the pilot experiment. The 
importance of the jig by run interaction can be illustrated if we list the 


TaBLeE XVI.— Typicau Ser or LINKING MEASUREMENTS INCLUDED 
IN Main Experiment. Units 1n 0.001 ps 




















Run IIt Run IV 
Transformer 
_ Jig 1 Jig 2 Jig 1 Jig 2 
65 4 230 15 195 
66 —40 191 10 195 
77 —65 92 —47 75 
80 —45 152 —18 148 
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TABLE XVII. — ANALYSIS OF VARIANCE. TypicaL LINKING SET IN 
Main EXPERIMENT 








Source Ss D/F MS ignilicance 

Between Jigs 127449 1 127449 K1% 
Between Runs 20 1 20 > 25% 
Between Transformers 22975 3 7658 K1% 
Jigs X Runs 931 1 931 <5% 
Jigs X Transformers 2262 3. 754 <5% 
Transformers X Runs 337 3 112 20% 
Jigs X Runs X Transf. 170 3 57 











TasLe XVIII. — Jig Comparison 





Jig 2 — Jig 1 (in db) 90% Confidence Limits ‘in db * 





Pilot 0.128 | 0.114 to 0.142 
Run I & II 0.114 0.054 to 0.174 
Run II & III 0.149 0.037 to 0.201 
Run III & IV 0.170 0.120 to 0.220 
Run IV & I 0.121 0.040 to 0.201 








average differences between the jigs as observed in the various pairs of 
runs and in the pilot experiment. In Table XVIII are also calculated 90 
per cent confidence limits for the jig difference based on a variance esti- 
mate incorporating the variances for the jig interactions. It was originally 
hoped to use an estimate of difference between the jigs to eliminate the 
jig effect from all the individual observation. The wide confidence limits 
of the jig difference estimates compared to the 0.01 db order of magni- 
tude we are interested in, do not allow us to do this. Therefore the sub- 
sequent analysis was made separately for both jigs. 

In addition to the gain measurements the following impedances were 
observed on all transformers: Resistive and Reactive component of 
leakage (Re and R,); Capacitance over the high winding (Cy); Stray 
Capacitances (C's, and C's,). These impedance results introduced in the 
formulas for the equivalent diagram of the amplifier yield a predicted 
gain, which should represent, if everything is all right, the measured 
gain values. 

Using the coefficients m;, 7 = 1, 2, --- 5, as computed from the 
_ equivalent diagram, we predict the transmission gain to be: 


Y =m + mRe+ mR, + mCu + mCs, + mCs, - 
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Here, mp is an arbitrary constant, not important in these considera- 
tions, as in measuring amplifiers of this type, frequency-independent loss 
networks are often introduced, which add an additional constant in m . 

Calling the measured transmission gain y, we will try to fit the model 


y=act BY. 


The Y is taken as the independent variable as the transformer parame- 
ter measurements are more precise than the transmission measurements. 
In general for this type of regression line fitting the independent variable 
should be known without error. 

If the equivalent diagram is adequate 8 should be equal to 1; our esti- 
mates 6 of 6 therefore should not differ significantly from that value. 
Table XIX lists for 8.3 mc the estimates of the slopes, their standard 
deviations, and the estimated standard deviations of the residual varia- 
tions not accounted for by the regression. The intercept a like the 
parameter mp» in the prediction equation, is of no interest as explained 
above. 

It is seen that the agreement of the slopes with the theoretical value 
1.00 is reasonably good, especially for Jig 2. 

The rejects selected for this experiment fall into two classes, those in 
one set of recent manufacture not meeting the manufacturing specifica- 
tions, but not too far removed from them, and the others left-over from 
the development program. Even for such groups with wide variations in 
their parameters not meeting the end requirements the agreement be- 
tween prediction and measurement is reasonable. The Jig 1 results gen- 


TaBLE XIX.— CoMPARISON BETWEEN THE REGRESSION PARAMETERS 
ESTIMATED FROM THE MEASUREMENTS IN Botu JIGS. 
- FREQUENCY 8.3 Mc 





Sages dora || Sigucaet Gieneh |) Staten eae 
Jig 1 Jig 2 Jig 1 Jig 2 Jig 1 Jig 2 
Standard production 50 1.38 0.97 0.18 0.11 0.04 0.02 


units 
Rejects from production 0.90 Lelk 0.11 0.07 0.18 0.08 
18 units 
Rejects from development | 0.78 0.82 0.18 0.08 0.07 0.03 
15 units 
All 83 units pooled 0.84 1.05 0.06 0.04 0.05 0.02 
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erally show a bigger deviation from the ideal value of 1 for the slope and, 
also, larger residual variations as indicated by the estimates of the vari- 
ance. It will be remembered that from the pilot experiment and the 
“built-in” control in the main experiment it appeared that the differ- — 
ence between Jig 1 and Jig 2 was not constant. Subsequently a poor 
contact in Jig 1 was identified. However, the general result of the ex- 
periment was satisfactory, in that the feasibility of maintaining the over- 
all performance of the amplifier within the required limits by controlling 
the parameters of the transformer was demonstrated. 


VI. CONCLUSION 


The foregoing describes some highlights in the statistical aspects of 
the development program of one of the critical components in the L3 
system. It will be clear that statistics can be a very powerful help, when 
integrated in the engineering efforts. 
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Runs Determined in a Sample by an 


Arbitrary Cut 


By PAUL S. OLMSTEAD 
(Manuscript received August 9, 1957) 


This paper, after making a critical review of the literature pertaining to 
runs above and below in a fixed sample, provides the following extensions: 

1. Sample arrangement distributions for runs of length at least s on one, 
each, and either side of any selected cut for samples of 10 and 20, 

2. Sample arrangement distributions for runs of length at least s on one, 
each, and either side of the median for samples of 10, 20, 40, 60, 100, and 
200, 

3. Sample arrangement distributions for runs of length at least s on each 
side of all possible cuts for samples of 10, 20, 40, and 100, 

4, Asymptotic values of the probabilities of such arrangements when the 
sample size and length of run are large, 

5. Convenient charts and tables for probabilities of 0.01, 0.10, 0.50, 0.90, 
and 0.99 to facilitate use by engineers and scientists, and 

6. Discussion of a simple application. 

The inclusion of the case for runs of length at least s on each side of all 
possible cuts should prove very useful because it provides a quantitative 
measure for a common operational procedure for which the exact proba- 
bilities were heretofore unknown. 


I. SUMMARY 


This paper discusses certain nonparametic measures for use in de- 
tecting the presence of assignable causes in experimental data. Specifi- 
cally, it assumes that a sample of n observations of a characteristic, X, 
has been obtained and that a particular arrangement, X,, Xo, -°--: Xn, 
e.g., by the time order of determination or other considerations, increases 
the value of the sample as evidence. Assuming a cut at a particular value 
of X, such as A, such a series may be divided into groups of consecutive 
observations that lie, alternately, above and below the cut. The length 
of such a group is called a run. 
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The paper also presents charts, tables, and formulas relating to such 
sample arrangement distributions for runs above and below any selected 
and all possible cuts or demarcation values. Specifically, it contains: 

a. A review of the literature relating to runs above and below (Sec- 
tion II). 

b. Appropriate charts and tables for the convenience of the engineer 
or other user (Section II and ITI). 

c. An example (Section III) and reference to others (Section II). 

d. A procedure for obtaining the probability that a randomly selected 
arrangement of a sample of size n will contain one or more runs of length 
at least s on each side of at least one of all posszble cuts or demarcation 
values that do not coincide with one of the numerical values in the 
sample (Section VI). 

e. Relationships between n and s for constant probability (Section 
VIII). 

f. The probability that a randomly selected arrangement of a sample 
of size n will contain one or more runs of length at least s on each side 
of a selected cut or demarcation value such that n; numerical values are 
above and ne numerical values are below (n = m + 72). Similar prob- 
abilities are given for arrangements with runs above, with runs below 
and with runs on either side of such a cut or demarcation value (Section 
IV). 

g. Simplified formulas for runs above and below the median that are 
equivalent to those given by Mosteller* (Section V). 

h. Asymptotic values of these probabilities for both n and s large 
(Section VII). 


II, HISTORICAL BACKGROUND AND DISCUSSION 


Runs above and below the average, the median, or some other selected 
value have been used by a number of engineers to assist in detecting and 
identifying assignable causes of variation in connection with research 
and development work. In order to have a clear picture of the problems 
of such work, it may be worthwhile to set down some statements which 
characterize it: 

a. A repetitive process that has not been examined for control by 
statistical methods and that has not subsequently been brought into 
control is very unlikely to be in statistical control, 

h. Causes of lack of control often occur ‘sporadically, being present for 
relatively short intervals of time, 

c. Such causes of lack of control may often be detected by taking ac- 
count of order either in manufacture or in taking observations, and 
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d. A basis for determining what fractions or portions of the observa- 
tions may have been affected by an undesired cause is the application of 
statistical tests to the pattern of the individual values of the measure- 
ments in the order in which they were obtained. 

Runs above and below have been particularly useful in assisting in 
the identification of such assignable causes. Their use in engineering has 
progressed through the following steps: 

1. Using a procedure based on the work of Cochran,’ Shewhart’ showed 
the distribution with respect to length of the runs above and below the 
average. It was his observation that a run of length 7 was often asso- 
ciated with a cause that could be found. Cochran had derived the dis- 
tribution of runs of lengths s (our notation) of twocomplementary events 
E, and E, of known probability, p, and gq = 1 — p, respectively. In 
applying Cochran’s formula, Shewhart chose tivo statistics, X and p, 
from his observed data. Recognizing that this might invalidate the use 
of Cochran’s formula, he suggested to the writer that this loophole could 
be avoided by working out the distribution for run lengths relative to 
the median. This distribution was worked out and recorded in a mem- 
orandum dated October 14, 1940. 

2. About the same time, Mood’ was working on his “Distribution 
Theory of Runs” for which the distribution relative to the median is a 
special case. He included in his results expressions for the variances and 
covariances. Campbell’ made use of the distribution of lengths of run 
relative to the median. 

3. The next step was to obtain the distribution of possible arrange- 
ments with runs of at least a given length relative to the median. Mood® 
gave a general analysis of the problem, which was supplemented in a 
form more easily comprehensible to the engineer by Mosteller.* Mosteller 
gave criteria based on sample size at given probability levels for length 
of run on one side and on either side of the median. While this paper was 
in preparation, Olmstead had been examining the problem of the prob- 
ability of arrangements with runs of at least a given length on each side 
of the median. When this was brought to Mosteller’s attention, his paper 
was revised to include this case which had its inception in the engineering 
idea that if two cause systems were operating in separate periods they 
would be likely to produce separate groups of high and low values. 

4. Following this, attention was given to the distribution of arrange- 
ments, as indicated in Section V of this paper, where division for runs 
above and below was made at some location other than the median. 
Validity in use of the probabilities calculated on this basis was dependent 
on the choice of division location prior to the test and often left the en- 
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gineer and the statistician uncertain concerning the risks that were being 
taken when the division location was chosen after looking at the data. 
Because of assumption (a) above, this did not worry the engineer as 
much as it did the statistician, particularly when the engineer could find 
a cause associated with long runs identified in this way. The fact that 
he usually found such a cause indicated that some other way of consider- 
ing the problem from the viewpoint of mathematical statistics would be 
fruitful. 

5. The obvious next step was to find a procedure for counting all of 
the possible arrangements of n numbers, no two alike, that would have 
one or more runs of length at least s on each side of at least one of all of 
the possible division points that do not coincide with one of the numeri- 
cal values in the sample. One way of doing this is first to write down or 
plot all (7!) possible arrangements of the n numbers. Assume that the 
numerical values of the numbers are the y-coordinates and the order 
in which they occur in an arrangement is indicated by the x-coordinates 
of such a plot. All such plots could then be examined to see what y-divi- 
sion not at one of the y-values would give the longest run of consecutive 
y-values on each side of the division. In this way, each arrangement 
would be assigned to a category where a particular length of run was 
equalled or exceeded on each side for at least one of the possible y-divi- 











LENGTH OF RUN,S 














2000 


SAMPLE SIZE,N 


Fig. 1 — Length, s, of run on one side of median versus sample size, n, for se- 
lected values of probability, P. 
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Fie, 2 — Length, s, of run on each side of median versus sample size, n, for 
selected values of probability, P. 
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Fic. 3 — Length, s, of run on either side of median versus sample size, n, for 
selected values of probability, P. 


sions. The process presented in Section VI is the mathematical equivalent 
of carrying out such a count. This process is gratifying to the engineer 
and the statistician alike because of the freedom permitted in setting the 
division location after examining the data so as to obtain the longest 
lengths of run on each side of the selected value. Use of this information 
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Fie. 4 — Length, s, of run on each side for any cut versus sample size, n, for 
selected values of probability, P. 


was made first in an article by Walker and Olmstead.° Its part in de- 
tecting the type of an assignable cause appeared first in an article by 
Olmstead.’ 

6. In connection with the investigation undertaken for this paper, 
the asymptotic relationships for determining probabilities when n and s 
are large have been obtained (Section VII) and the results compared 
with those given by the exact relationships. The exact relationships ap- 
plying to the median have been calculated for sample sizes of 60, 100, 
and 200 extending this information beyond the range usually covered 
by research workers. For the convenience of such workers, four charts 
(Figs. 1, 2, 3, and 4) have been prepared to show the relationships be- 
tween s and n for P = 0.01, 0.10, 0.50, 0.90, and 0.99 for the primary 
types of runs. 


III. WORKING TECHNIQUES 


As just mentioned, Figs. 1, 2, 3, and 4 present graphically five per- 
centage points of each of the four “above” and (or) “below” run dis- 
tributions for all sample sizes from 10 to 2,000. The same information is 
furnished in tabular form in Tables I, II, III, and IV. How these are de- 
rived and calculated is discussed later (Sections V, VI, and VIII). Spe- 
cifically, the four types of distribution thus made available are: 

a. The probability, P, of the event that the length of the longest run 
on one pre-chosen side of median equals or exceeds s; if above, the prob- 
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ability is designated P(s/—, medium); if below, P(—/s, median). The 
notation, P(s/—, median) may be read — the probability that an ar- 
rangement will contain a run of length at least s above the median. 

b. The probability of the event that the length of the shorter of the 
longest run above and the longest run below the median equals or ex- 
ceeds s: designated P(s/s, median), where s/s means that there is a run 
of length at least s, on each side of the median. 

c. The probability, P, of the event that the length of the longer of 
the longest run above and the longest run below the median equals or 
exceeds s: designated P(s, median), where s means the longer of (s/— 
median) and (—/s, median). 

d. The probability, P, of the event that the length of the shorter of 


TABLE I 


Minimum sample sizes, n, that exceed selected probabilities, P, for a 
given length, s, of run on one side of median calculated from Table 
XVI and equations (23) and (27) to three significant. figures. 


Probability, P 


Run Length 
s 
0.01 0.10 0.50 0.90 0.99 
1 2 2 2 2 2 
2 4 4 6 8 12 
3 6 6 12 22 38 
4 8 10 22 54 100 
5 10 16 46 116 230 
6 14 26 92 260 490 
7 18 44 182 530 1044 
8 26 78 360 1104 2140 
9 38 142 714 2240 4370 
10 56 256 1424 4530 8980 
11 86 480 2850 9190 18240 
12 140 930 5680 18540 37200 
13 234 1838 11330 37600 75500 
14 410 3630 22700 75700 151700 
15 748 7160 45300 151700 303000 
16 1446 14190 90600 303000 607000 
17 2830 28100 181200 607000 1214000 
18 5530 56100 362000 1214000 2430000 
19 10860 117300 725000 2430000 4850000 
20 21500 235000 1450000 4850000 9710000 


Examples of use: 














Observed Data Probability, P 


Case 1 n = 96 s= 4 0.90 < P < 0.99 
2 54 10 P < 0.01 
3 56 10 0.01 < P < 0.10 


TABLE IT 


Minimum sample sizes, n, that exceed selected probabilities, P, for a 
given length, s, of run on each side of median calculated from Table 
XVI and equations (24) and (27) to three significant figures. 

















Probability, P 
Run Length 
s 
0.01 0.10 0.50 0.90 0.99 
1 2 2 2 2 2 
2 4 4 6 10 14 
3 6 8 14 26 44 
4 8 14 30 ; 68 116 
5 12 26 68 152 252 
6 20 50 140 322 552 
7 34 98 290 676 1164 
8 62 194 596 1390 2390 
9 116 390 1208 2830 4930 
10 216 782 2440 5650 10140 
11 446 1182 4910 11750 20700 
12 884 2360 9840 23800 42500 
13 1762 4720 19890 48600 86700 
14 3510 9450 39900 98600 174200 
15 6990 18900 80500 197300 3848000 
16 13930 37800 161300 395000 697000 
17 27900 75600 323000 789000 1394000 
18 55500 151200 645000 1578000 2790000 
19 111000 3802000 1290000 3160000 5570000 
20 222000 605000 2580000 6310000 11150000 
TaBLeE IIT 


Minimum sample sizes, n, that exceed selected probabilities, P, for a 
given length, s, of run on either side of median calculated from Table 
XVI and equations (25) and (27) to three significant figures. 














tas Probability, P. 
Run Length 
s 
0.01 0.10 0.50 0.90 0.99 

1 2 2 2 2 2 
2 A 4 4 8 10 
3 6 6 8 16 28 
4 8 8 16 36 64 

5 10 14 30 76 1386 

6 12 20 58 152 282 

7 16 32 106 296 568 

8 22 52 200 580 1150 
9 32 86 388 1174 2310 
10 42 150 758 2350 4640 
11 62 262 1488 4720 9330 
12 94 500 2920 9460 18730 
13 156 962 5860 10660 37700 
14 256 1876 11250 21300 75700 
15 418 3670 22600 42600 151600 
16 766 7330 45200 85300 303000 
17 1472 14090 90100 170500 606000 
18 2860 27900 180300 341000 1213000 
19 5570 55500 361000 682000 2430000 
20 10860 111100 721000 1364000 4850000 
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TaBLE IV 
Minimum sample sizes, n, that exceed selected probabilities, P, for a 
given length, s, of run on each side of any cut calculated from Table 
XVI and equations (26) and (27) to three significant figures. 



































Probability, P 
Run Length 
Ss 
0.01 0.10 0.50 0.90 0.99 
1 2 2 2 2 2 
2 4 4 6 8 12 
3 6 8 12 22 34 
4 8 12 22 48 . 1 
5 12 18 46 96 - 162 
6 16 34 86 192 380 
7 24 58 166 382 668 
8 38 108 324 760 1342 
9 66 204 638 1518 2690 
10 118 400 1266 3030 5410 
11 228 790 2530 6070 10870 
12 444 1568 5050 12130 21500 
13 878 3130 10070 24300 43100 
14 1750 6220 20100 48500 86200 
15 3480 12490 40300 97000 172300 
16 6790 25000 80600 194100 345000 
17 13860 49900 161100 388000 689000 
18 27700 99900 322000 776000 1379000 
19 55400 199800 644000 1553000 2760000 
20 110800 400000 1289000 3110000 5510000 
TABLE V 
Speedometer readings at one minute intervals. 

Time MPH Time MPH Time MPH Time MPH 
1 48 15 55 29 52 43 60 
2 50 16 53 30 58 44 58 
3 48 17 48 31 55 45 55 
4 50 18 50 32 57 46 57 
5 52 19 50 33 58 47 57 
6 49 20 55 34 58 48 53 
7 50 21 55 35 58 49 57 
8 47 22 55 36 58 50 58 
9 51 23 55 37 58 51 58 

10 50 24 55 38 58 52 56 
11 49 25 51 39 55 53 58 
12 52 26 53 40 56 54 63 
13 53 27 52 41 57 55 60 
14 53 28 51 42 56 56 50 
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the longest run above and the longest run below a cut chosen to maxi- 
mize this length equals or exceeds s: P(s/s, any cut) with meaning similar 
to that for P(s/s, median) but for the case where the cut has been 
chosen to maximize the shorter of the longest runs on each side. 

The use of these distributions can be illustrated by the calculation of 
the various run length statistics for a specific example. The 56 speedom- 
eter readings presented in Table V and Fig. 5 were observed at one 
minute intervals during a driver’s first trip on a toll highway with 
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Fig. 5 — Readings at one-minute intervals. 
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Fig. 6 — Chart for deviations from trend line. 
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separate traffic lanes. In this instance, nine observations occur at the 
median (55) with 22 above and 25 below. This is not unusual in experi- 
mental work where ties are likely at or near the median. (It should be 
pointed out that the occurrence of ties makes this a difficult example. 
Later, this example will be modified by removing a trend and then it 
will be simpler. Consideration will first be given to runs with respect to 
the median and then to ‘‘any cut.’’) Various methods of resolving such 
ties are possible. The most conservative is to use a tied median to termi- 
nate a run. The least conservative is to use the tied median or medians 
for inclusion in the run. Intermediate between these is to consider all 
possible allocations and their effects on run length. Here, in order to ob- 
tain 28 above and 28 below the median, it is necessary to allocate the 
nine tied at the median so that 6 will be above and 3 below. The run 
length associated with each such combination would then be obtained 
and, if desired, the average computed. In this case, the lengths of the 
various runs obtained by these three methods are as follows: 

















Run Lengths, s Per 

es Cent 

Type of Run i Limit for P S 0.01 Below 

Most Con} average ficast Com Limit 

Above 7 167 si. C18 11 (Table I) 33 
Below 14 15.8 21 11 (Table I) 0 
Each Side 7 12.8 18 8 (Table IT) 1 
Either Side 14 16.6 21 11 (Table ITI) 0 
Each Side, Any Cut 14 — — 9 (Table IV) — 








It will be observed that only one answer results for the ‘‘each side of 
any cut.’ Also, three of the five tests on the most conservative basis are 
above their respective limits for a P of 0.01 and all on the other bases. 
This happens quite frequently in engineering problems. 

‘It is apparent, however, in this case, that there is a consistent trend 
throughout the set of data. In Fig. 6, this has been removed and the 
median lies between the 28 points above and the 28 points below. The 
following statistics are obtained: 





Run Lengths, s 








Type of Run P for Observed Run 
Observed Limit for 
Run Ps 0.01 
Above 9 11 (Table I) 0.03 
Below 5 11 (Table I) 0.60 
Each Side 5 8 (Table IT) 0.42 
Either Side 9 11 (Table ITI) 0.05 
Each Side, Any Cut 9 9 (Table IV) 0.008 
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In this case, only the statistic for the longest run on each side of any 
cut has a P as low as 0.01. Two others were of the order of 0.05 and the 
remaining two near 0.50. The explanation of the indicated nonrandom- 
ness was identified with human behavior under conditions of learning. 

This example raises a question about the treatment of odd sized sam- 
ples, where the median is a single observation. These may all be reduced 
to even sized samples by omitting the median. This is unnecessary in the 
case of the longest run on “each side of any cut” where the P values for 
a given s for the odd sized sample lie between those for the adjacent even 
sized samples. 


IV. SOME SPECIFIC SAMPLES 


Table VI presents the values of probabilities P(s/—, m/ne) and 
P(—/s, m/n2), for every possible separation of 10 = m + me observa- 
tions into 7 on one side of a cut and nz on the other. Table VII does the 
same for 20 = m, + m2 observations. Similarly, the values of P(s/s, m1/n2) 
and P(s/— or —/s, m/nz) are given in Tables VIII and IX, and Tables 
X, and XI, respectively. | 

In Tables XII, XIII, and XIV, the table presented by Mosteller* for 
the three kinds of runs with respect to the median, that is, where m1 =z, 
has been extended to include samples of 60, 100, and 200. 

The values of P(s/s, any cut) for n = 10, 20, 40, and 100 are given in 
Table XV. It will be noted that the values of the probabilities in this 
table differ only slightly from those in Table XIII for P(s/s, median). 
For large sample sizes, other considerations suggest that the s-values 


TABLE VI 


Probability of an arrangment with a run of length at least s on “one 
side” of a demarcation value for ny + nz = 10 calculated from equation 


(1) or (2). 


Total on the “‘one side’, i.e., m1 or m2 

















Length of Run 
s 
9 8 7 6 5 4 3 2 1 
1 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 
2 1.000 | 1.000 | 1.000 | 1.000 | 0.976 ; 0.833 | 0.533 | 0.200 
3 1.000 | 1.000 | 0.967 | 0.786 | 0.500 | 0.233 | 0.067 
4 1.000 | 0.933 | 0.667 | 0.357 | 0.148 | 0.033 
5 1.000 | 0.667 | 0.333 | 0.119 | 0.024 
6 0.800 | 0.400 | 0.133 | 0.024 
7 0.600 | 0.200 | 0.033 
8 0.400 | 0.067 
9 0.200 














TaBLe VII 


Probability of an arrangement with a run of length at least s on “one side” of a demarcation value for 
m + m2 = 20 calculated from equation (1) or (2). 


Length of Run s 


WONT OWN 








18 


16 





pommel femme mh fom food fesh fm feed ped, 
S 
fm) 
Oo 


1.000 
1.000 
1.000 
1.000 
1.000 
1.000 
0.995 
0.947 
0.853, 
0.711 
0.568 
0.442 
0.332 
0.237 
0.152 
0.095 
\0.047 
0.016 








1.000 
1.000 
1.000 
1.000 
1.000 
0.982 
0.898 
0.751 
0.579 
0.421 
0.295 
0.196 
0.123 
0.070 
0.035 
0.014 
0.004 


1.000 
1.000 
1.000 
1.000 
0.986 
0.889 
0.707 
0.509 
0.341 
0.217 
0.130 
0.072 
0.036 
0.015 
0.005 





0.001 


0.098, 





15 





1.000 
1.000 
1.000 
0.996 
0.920 
0.721 
0.492 
0.307 
0.179 


0.049 
0.022 
0.008 
0.002 
0.000 





Total on the ‘‘one side’’, i.e., 21 or 22 


14 13 12 11 10 9 





8 








1.000)1 .000)1 .000/1 .000)1 .000/1 .000 
1.000/1 .000}1 .000/1 .000)1 .000;0.999 
1.000,0.999 0.988 0.950/0.870/0.742 
0.971,0.900,0.779|0.622/0.457/0.307 
0.790,0.621/0.447/0.295)0.178|0.098 
0.527|0.351 0.214/0.119,0.060/0.026 
0.309,0.177|0.092/0.043/0.017|0.006 
0.167 0.082,0.035 0.013,0.004/0.001 
0.083 0 .034'0.012:0.003 0 .001,0.000 
0.038 0.012,0.003)0.001/0.000 
0.01510.004/0.001/0.000 
0.005)0.001|0.000 

0.001/0.000 

0.000 





1.000 
0.990 
0.582 
0.187 
0.047 
0.009 
0.001 
0.000 





1.000 
0.956 
0.415 
0.101 
0.019 
0.005 
0.000 





1.000 
0.871 
0.260 
0.046 
0.006 
0.000 





1.000 
0.718 
0.140 
0.017 
0.001 





1.000 
0.509 
0.060 
0.004 





1.000 
0.284 
0.016 














1.000 
0.100 





1.000 
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TasBuLeE VIII 


Probability of an arrangement with a run of length at least s on “each 


(4). 


Length of 
Run 
s 


oR WD 


ny OF Ne 
ne Or m1 


side” of a demarcation value for 2 + ne = 10 calculated from equation 

















1 2 3 4 5 

9 8 7 6 5 
1.000 1.000 1.000 1.000 1.000 
0.200 0.533 0.833 0.960 
0.067 0.224 0.333 
0.029 0.056 
0.008 

TABLE IX 


Probability of an arrangement with a run of length at least s on ‘each 
side” of a demarcation value for n; + ne = 20 calculated from equation 





1.000; 1.000} 1.000) 1.000) 1.000 


(4). 
Length 1 
ee horm| 19 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


0.100} 0.284) 0.509] 0.718 
0.016} 0.060} 0.140 
0.004] 0.017 

0.001 





TABLE X 








1.000} 1.000 
0.958} 0.990 
0.413] 0.581 
0.100; 0.179 
0.012) 0.042 
0.002) 0.007 
0.000} 0.001 
0.000 














1.000) 1.000 
0.999} 1.000 
0.727| 0.784 
0.245) 0.274 
0.056) 0.064 
0.011} 0.013 
0.002) 0.002 
0.000} 0.000 
0.000) 0.000 

0.000 








Probability of an arrangement with a run of length at least s on ‘either 
side” of a demarcation value for mn, + n2 = 10 calculated from equation 














(5). 

Length of “ 1 2 3 4 
po ann 9 8 7 6 3 
1 1.000 1.000 1.000 1.000 1.000 
2 1.000 1.000 1.000 1.000 0.992 
3 1.000 1.000 0.967 0.795 0.667 
4 1.000 0.933 0.667 0.362 0.230 
5 1.000 0.667 0.333 0.119 0.040 
6 0.800 0.400 0.183 0.024 
7 0.600 0.200 0.033 
8 0.400 0.067 
9 0.200 











TaBLe XI 


Probability of an arrangement with a run of length at least s on “either 
side” of a demarcation value for n; + ne = 20 calculated from equation 






































(5). 

Length 

eon ot 9 i: Fa tb : vi e i ti 7 
1 1.000} 1.000} 1.000} 1.000} 1.000; 1.000] 1.000) 1.000) 1.000) 1.000 
2 1.000} 1.000} 1.000} 1.000} 1.000; 1.000} 1.000) 1.000; 1.000] 1.000 
3 1.000} 1.000} 1.000] 1.000] 1.000} 1.000) 0.999} 0.989) 0.966} 0.956 
4 1.000; 1.000) 1.000} 1.000} 0.996; 0.971] 0.901] 0.787) 0.684] 0.640 
5 1.000) 1.000} 1.000! 0.986} 0.920) 0.790} 0.622} 0.452) 0.337) 0.293 
6 1.000} 1.000) 0.982) 0.889] 0.721] 0.527} 0.351] 0.217) 0.134! 0.106 
7 1.000) 0.995] 0.898] 0.707) 0.492) 0.309! 0.177) 0.092) 0.046) 0.032 
8 1.000) 0.947] 0.751) 0.509] 0.307! 0.167) 0.082) 0.035) 0.014! 0.007 
9 1.000] 0.853} 0.579) 0.341!) 0.179) 0.083} 0.0384] 0.012) 0.003) 0.001 
10 1.000] 0.711} 0.421] 0.217) 0.098! 0.038} 0.012) 0.005) 0.001) 0.000 
11 0.900] 0.568} 0.295} 0.180) 0.049] 0.015} 0.004} 0.001) 0.000 
12 0.800} 0.442} 0.196} 0.072} 0.022) 0.005) 0.001] 0.000 
13 0.700} 0.332} 0.125) 0.036) 0.008) 0.001; 0.000 
14 0.600} 0.237) 0.070) 0.015) 0.002} 0.000 
15 0.500) 0.158} 0.035} 0.005] 0.000 
16 0.400} 0.095} 0.014] 0.001 
17 0.300} 0.047} 0.004 
18 0.200} 0.016 
19 0.100 

TABLE XII 


Probability of an arrangement with a run of length at least s on “one 
side” of median calculated from equation (1) or (2). 


Length of Run 
s 


SONNE WNHe 


22 or over 


Sample size, n 





10 


20 


40 





1.00000 
0.97619 
0.50000 
0.14286 
0.02381 








1.00000 
0.99994 
0.86973 
0.45713 
0.17849 
0.05960 
0.01703 
0.00395 
0.00065 
0.00006 


1.00000 
1.00000 
0.99225 
0.79885 
0.44954 
0.20733 
0.08697 
0.03438 
0.01290 
0.00458 
0.00153 
0.00047 
0.00014 
0.00004 
0.00001 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 





60 


1.00000 
1.00000 
0.99956 
0.92695 
0.63645 
0.33935 
0.15952 
0.07046 
0.02996 
0.01235 
0.00494 
0.00192 
0.00072 
0.00026 
0.00009 
0.00003 
0.00001 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 





100 


1.00000 


1.00000 
1.00000 
0.99049 
0.84289 
0.54439 
0.29185 
0.14251 
0.06642 
0.03015 
0.01344 
0.00589 
0.00255 
0.00108 
0.00045 
0.00019 
0.00008 
0.00003 
0.00001 
0.00000 
0.00000 
0.00000 


200 


1.00000 
1.00000 
1.00000 
0.99994 
0.98093 
0.82160 
0.54174 
0.30295 
0.15529 
0.07621 
0.03656 
0.01731 
0.00813 
0.00378 
0.00175 
0.00080 
0.00037 
0.00017 
0.00007 
0.00003 
0.00001 
0.00000 
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TasBLe XIII 


Probability of an arrangement with a run of length at least s on “each 


side” of median calculated from equation (6) or (4). 





Length of Run 
s 


WONOOFWNE 


16 or over 


10 





20 





1.00000 
0.96032 
0.33333 
0.05556 
0.00794 





Sample Size, n 


40 


60 





100 


200 








Probability of an arrangement with a run of length at least s 
side” of median calculated from equation (5). 


Length of Run 
AY 


oo 
KS OOONOOPWNE 


22 or over 


10 


1.00000 


0.99206 
0.66667 
0.23016 
0.03968 

















1.00000 1.00000 1.00000 1.00000 1.00000 
0.99989 1.00000 1.00000 1.00000 1.00000 
0.78582 0.98519 0.99912 1.00000 1.00000 
0.27412 0.66809 0.88729 0.98159 0.99987 
0.06356 0.24933 0.44250 0.72496 0.96284 
0.01288 0.06820 0.14723 0.33308 0.68619 
0.00249 0.01647 0.03992 0.10591 0.31877 
0.00045 0.00379 0.00992 0.02919 0.10573 
0.00008 0.00085 0.00238 0.00747 0.03027 
0.00001 0.00019 0.00056 0.00185 0.00800 
0.00004 0.00013 0.00045 0.00203 

0.00001 0.00003 0.00011 0.00051 

0.00000 0.00000 0.00002 0.00013 

0.00000 0.00000 0.00000 0.00003 

0.00000 0.00000 0.00000 0.00001 

0.00000 0.00000 0.00000 0.00000 

TABLE XIV 
on “either 
Sample Size, n 
20 40 60 100 200 

1.00000 1.00000 1.00000 1.00000 1.00000 
0.99999 1.00000 1.00000 1.00000 1.00000 
0.95564 0.99931 1.00000 1.00000 1.00000 
0.64014 0.92961 0.98660 0.99938 1.00000 
0.29342 0.64975 0.83041 0.96082 0.99901 
0.10632 0.34646 0.538147 0.75569 0.95701 
0.03157 0.15747 0.27911 0.47779 0.76970 
0.00741 0.06497 0.13100 0.25582 0.50017 
0.00122 0.02495 0.05754 0.12538 0.28031 
0.00011 0.00897 0.02414 0.05846 0.144438 
0.00302 0.00975 0.02642 0.07108 

0.00093 0.00380 0.01168 0.03411 

0.00028 0.00144 0.00506 0.01613 

0.00008 0.00052 0.00216 0.00753 

0.00002 0.00018 0.00090 0.00349 

0.00000 0.00006 0.00038 0.00160 

0.00000 0.00002 0.00016 0.00074 

0.00000 0.00000 0.00006 0.00034 

0.00000 0.00000 0.00002 0.00014 

0.00000 0.00000 0.00000 0.00006 

0.00000 0.00000 0.00002 

0.00000 0.00000 0.00000 
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TABLE XV 
Probability of an arrangement with a run of length at least s on “each 
side”’ of at least one of all possible demarcation values calculated from 
equation (22). 

















Sample Size, n 
Length of Run 
s 
10 20 40 100 

1 1.00000 1.00000 1.00000 1.0000 

2 0.97937 0.99997 1.00000 1.0000 

3 0.46190 0.89748 0.99713 1.0000 

4 0.08413 0.44121 0.83760 0.9986 

5 0.00794 0.12994 0.43401 0.9125 

6 0.02943 0.15840 _ (0.5863) * 

7 0.00559 0.04544 (0.2561 )* 

8 0.00093 0.01179 0.0876 

9 0.00013 0.00277 0.0263 
10 0.00001 0.00066 0.0073 
11 0.00015 (0.0020)* 
12 0.00003 (0.0005) * 
13 0.00001 (0.0001 )* 
14 or over 0.00000 (0.0000) * 








* Values in parentheses were interpolated or extrapolated. 


will increase by unity. All this is in accord with the experience of the 
engineer who did not hesitate to use available information for P(s/s 
median) as being a good first approximation to P(s/s, any cut). 


V. SAMPLE ARRANGEMENT DISTRIBUTIONS WITH RUNS OF LENGTH AT 
LEAST S§ ABOVE AND BELOW ANY SELECTED CUT 


Assume a finite sample of n = m -++ me numbers, of which n, have 
the common property of being above the selected cut and, similarly, 
nm, are below. Clearly, the ne numbers may be considered as providing 
(nz + 1) cells or partitions of the n; numbers above. Some of these cells 
or partitions will, of course, be empty, particularly when n is less than 
(no + 1). If at least s of the n; numbers are to be in one partition, it 
would first appear that the number of ways would be proportional to 
the number of possible partitions, nz -+ 1, and also to the number of ways 
in which the partition boundary points, ne, may be selected from the 
remaining numbers, n — s, i.e., the combination of (n — s) things taken 
nm, at a time. This, however, gives an over-estimate because it counts 
twice each arrangement that has two partitions of s each, three times 
for each arrangement that has three partitions of s each, etc. Taking 
these factors into account, it is found that the number of ways of par- 
titioning the n; numbers by means of the mz numbers so as to obtain one 
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or more partitions that contain s or more elements is: 
[+] cath 

» e 1) % + ‘) @ — ") 

7=1 j Ne 


Having this, we may write down the probability of an arrangement of 
n numbers that will contain at least one run of length s or more among 
the nm numbers that are above our demarcation value by dividing by 


(.): 


Po/imind = Sen (™ED (Mma) 


n J ne 7 
(i 


In a similar manner, we may, by interchanging n; and nz, write down 
the probability of an arrangement of n numbers that will contain at: 
least one run of length s or more among the m2 numbers that are below 
our demarcation value: 


ne 


Piamind = AE COM (ENOL. 


(.) 


To assist in determining the probability that an arrangement will 
contain at least one run of length s or more on each side of the demarca- 
tion value, let us assume that we have partitioned the ni numbers above 
into r runs of which at least one is of length at least s. These r runs may 
be associated with (r — 1) runs or partitions of the m2 in only one way, 
with (r + 1) runs of the mm in only one way, but with r runs of the nz 
in two ways. Each of these sets of possible runs must contain at least 
one run of length s or more. The resulting partitioning count for s, n , 
and r is: 


* Some readers may wish to note that 


(",) P(s/—, m1/nz) 


is the coefficient of 2% in the expansion of (1+ 2+ 22+ ---)mtt— (1 +a¢+ 
oe a gs-})neth 
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Es | 
Ce aes (—1)# (5) « =D = je= * fori =1,2 (3)* 
j=1 Jj Cal 
and B(nz,r — 1) and B(n. , r + 1) are obtained by substituting (r — 1) 
and (r + 1) respectively for r in (8). All that is needed to secure the 
desired probability is to find the count of the possible arrangements in 
both n; and nz corresponding to each 7, sum with respect to r and divide 
by the total possible arrangements: 


ny—stl 


P(s/s, m/n2) = =e > Bia ,r)[Blr,r — 1) + 2B(nz , 7) 


@) ; 
+ Bin,r + 1). 


To find the probability that an arrangement will contain at least one 
run of length s or more on ezther side of the demarcation value, it should 
be noted that (4) is counted in both (2) and (1). Thus, this probability 
is simply: 

P(s/— or —/s, m/n2) = P(s/—, n/n) 
+ P(—/s, m/m) — P(s/s, m/nz) 
where the probabilities on the right hand side of (5) are given by (1), 
(2), and (4) respectively. 
When the median is used as the demarcation value, mn, = nz, so that 


P(s/—, median) = P(—/s, median). In addition, by rearranging terms, 
P(s/s, median) may be written in the simplified form: 


(5) 


ny—stl 


P(s/s, median) = (2) a [Bim ,r) + Bu, r+ DP (6) 
ny 


where B(n, , r) and B(m , r + 1) are defined by (8) as before. Equation 
(6) has been used for the new calculations reported here. (See Section IV) 


VI. SAMPLE ARRANGEMENT DISTRIBUTIONS FOR RUNS OF LENGTH S OR 
MORE ON EACH SIDE OF AT LEAST ONE OF ALL POSSIBLE DEMARCATION 
VALUES 


When this derivation was first discussed with a mathematical statis- 
tician, he questioned whether anyone would want a criterion based on 
* Here, B(n;, r) = B(ni, 1, s) is the coefficient of 2% in 
Cr ae al can A a a a ees a a i Bi 
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such a distribution. To make it clear that the engineer does want it, 
assume that we have a set of data and Tables VI to XI, inclusive. The 
engineer might look for the longest run on either side of the median. 
Having found it, he might pick a demarcation value that would just 
include this run. This would give him, for instance, 


m + Ne 


Ne < 9 





on one side of his demarcation value and 


m+ Ne 
2 





mM > 


on the other side. He might then look for the longest run on the 7; side. 
This would give him two long runs that might be equal in length or one 
shorter than the other. In either case, he could obtain a value of s for 
the length of run that is equalled or exceeded on each side of his de- 
marcation value. If his total sample happened to be 20, he could obtain 
P(s/s, 1i/ne) from (4) or Table IX for m, nz, and s. This probability, 
however, is based on his having chosen n and m2 before the experiment 
and therefore does not indicate what the true probability associated 
with this process is. At the same time, it is reasonably certain that this 
is a procedure that many engineers would be inclined to follow if they 
did not have prior knowledge concerning where to set the demarcation 
value. 

To facilitate the solution, it will be assumed that no two of the n 
values in a sample of size n are identical. For the analysis given here, 
n is taken to be even. Study of small samples shows that when n is odd, 
P(s,n — 1)S P(s,n) S P(s,n + 1). Taking (6) (with the median as 
initial cut) as a starting point, assume that the demarcation value is 
moved so that (m: + 1) values are on one side and (m — 1) values on 
the other. This adds a fraction of the total arrangements with runs of 
length s or more on each side of the new demarcation value equal to: 


AiP(s/s,m + 1/m — 1) 
1 n1—s 
eo Oa = LA ee = 1) 
(mn < 1) as ; ce @) 


+ 2A(m + 1,7) + A(m + 1,7 4+ 1)] 
where Bin — 1, r) is given by (8) above and 
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r—1 
Atm + 1,r) =r 2 (-1)'fr +s — (41 + 1) 
LG LDEAD ? ; : 
[fm +1 —-s -— js — 1) 
r—l 
_ {mt — 2s — js — 1) 

eo 


teen Eco 5’) 


oes) 
rf 
(lame ee) 
r 


The essential points in the derivation of (7) and (8) may be perceived 
most easily by considering some typical computations. Suppose that 
we wish to derive A,P(4/4, 6/4), having previously derived all of the 
values of P(s/s, 5/5) from (6). The possible combinations with a run of 
at least 4 on each side of a cut with 6 above and 4 below have the follow- 
ing orders: 

1. 6 above and 4 below, or 4 below and 6 above, 

2. 5 above, 4 below, and 1 above, or 1 above, 4 below, and 5 above, and 

3. 4 above, 4 below, and 2 above, or 2 above, 4 below and 4 above. 

The simplest of these is the first. Starting with the value of P(4/4, 5/5) 
as given by (6), we now wish to determine how much additional proba- 
bility is associated with moving the cut from the median to a point where 
6 are above and 4 are below. Since there are 6 possible locations in the 
new arrangement for the value that was moved from below to above 
the cut and (3) ways for arranging 6 above and 4 below, the total pos- 
sible combinations of these provides the factor given in the denominator 
of (7), in this case 6(’3). Since there is only one combination possible 
for 4 items taken 4 at a time, B(4, 1) as given by (8) is as might be ex- 
pected unity. Then, since we must have at least one run above the cut, 
A(6, 0) must be 0. The first important question relates to the value of 
A(6, 1). Since there is only one run of 6, it is casy to see that a run of 
length 4 or more must have occurred above the median if the value 
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moved from below is now in position 1, 2, 5, or 6 in the new run. Hence, 
there are only two possible locations for the value moved that give new 
combinations that have not been counted with respect to the median. 
At this point, it will be observed that for this case, r = 1, this value is 
given by 2s — (m + 1). Since there is exactly one run on each side of 
the new cut, the coefficient 2 appears before the A(m + 1, r) in (7) to 
take account of the two ways that these runs may be arranged, namely, 
6 above followed by 4 below and 4 below followed by 6 above. 

Now consider the ways in which we may have two runs with the 
restriction that one must be of length 4 or more. This is to be given by 
A(6, 2). In this case, there are two such run combinations, one with runs 
of lengths 5 and 1, and one with runs of lengths 4 and 2. Obviously, the 
value that was moved could not have been in the short run in either case 
because these arrangements would have had long runs of length 4 or 
more that would have been counted with respect to the median. In the 
case of the run of length 5, it could not be on either end but in the run 
of length 4, it could be at any one of the positions in the run. We also 
observe that with two runs of dissimilar lengths, the positions of the 
runs may be interchanged. This gives in this case a factor 2. Hence, we 
find that A (6, 2) is2-3 + 2-4, or 14. Toconform with (8), thissum would 
have to be written as 2-3-2 + 2-1, although, at this point, it may not be 
clear that this is a reasonable thing to do. However, by extending the 
investigation step by step, it is found that the various terms in (8) are 
required. Specifically, the 7 becomes necessary when 1 + 1 becomes 
greater than 2s — 1 and the binomial coefficients with terms in 2s are - 
introduced so that any combination that already has a run of length s 
on the basis of the median will not be counted again. 

Obviously, this process may be continued by moving the cut to in- 
clude (mn; + 2) values on one side and leave (m1 — 2) values on the other. 
Proceeding in this way, the fraction added in going from (m + 7 — 1) 
values above and (nm. — 7 + 1) values below to (m + 7) above and 
(m — 12) below is given by: 


A;P(s/s, ™ + i/m a 1) 


1 (ny-i)=(s—1) 


SS Se ee > Blu — i,r) 
(ny +. i) e 2m ) r=1 (9) 


1— 4 
‘{A(m + i,7r — 1) + 2A(m + 7,7) + A(m + 7,7 + 1)) 


where B(n — 1, 7) is defined by (8) above and 
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Atm +47) =D (-Dir te — mtd + G+ D6 - DI 


CE MC ae) 


_ (mt inte) gn Bene( 5) 


ie +7i— a qs — 2) i « —1t— . qs — » i 


One of each of these A’s is added in going from the median to each 
side. Therefore, the desired probability of an arrangement with runs 
of length s or more on each side of at least one of all possible demarcation 
values is: 


(10) 


nN 1—Ss 


P(s/s, any cut) = P(s/s, n/m) + 2 pe AiP(s/s,m1 + i/m — 7). (11) 


VII. ASYMPTOTIC DISTRIBUTIONS 


Intuitively, the asymptotic distribution of arrangements with 0, 1, 2 
etc., runs of length s or more for mi/n = e, a constant, would be ex- 
pected to become Poisson Exponential as n becomes large. Referring to 
Mood,’ the expected number of runs of length s or more on one side of a 
demarcation value is his expression (3.13), which may be written: 


(8) 
E(ris) = (me + 1) a ~ ner es for n and nz large (12) 


where (ri;) is the expected number of runs of length s or more on the 
side of the cut designated 1; superscript (s) designates a factorial mo- 
ment, €.g., 


n® = n(n — 1)(n — 2) +--+ (n—8 +1) (13) 


and e, and é are written for n/n and n2/n, respectively. 
The variance is his expression (3.15), or 


at (ne + 1)? n,°” 


Orisrig ™ Oss ne 


(s) 
+ (m2 + 1) 2 
n's 
ny 
(1 — (n+ 1) ") (14) 


(2 s-l 2 
~ nee] — se es — @°) 


~ neyer = E(r.) for s,m, and ne large. 
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Corresponding expressions for the side designated 2 may be obtained 

by interchanging the subscripts, 1 and 2, in equations (12) and (14). 
Mood’ also derives an expression (3.18) for the covariance of num- 

bers of runs equal to or greater than specified lengths on the two sides 

of the demarcation value. For runs of length s or more on each side, this 

becomes: 

Ge aie ann (nm + 1) (ne + 1) n. 


n 2s) : nes-D ns) 6s) 
(15) 


2 
~ Nn een (see. — s + 1) 
2 1 1 
~n se e°" for s,m, and nz large. 


From (14) and (15), it is clear that the covariance between long runs 
on the two sides becomes negligible for s, mn , and nz large and the occur- 
rence of long runs on each side may be treated as independent. 

Since Mood’ has shown (his Theorem I) that the distribution of the 
number of runs of length s or more on one side is asymptotically normal 
and by (12) and (14) above, the first two moments are those of a Poisson 
Exponential, the asymptotic probabilities of arrangements with runs of 
length s or more may be approximated by: 


On side 1: 
| P(s/—, m/me) = 1 — oo; (16) 
On side 2: 
P(—/s, m/m) = 1 — & ?'; (17) 
On each side: 
P(s/s, m/m) = (1 — & "?")(1 — @"*2°"); (18) 
On either side: 
P(s/— or —/s, m/m2) = 1 — @ Meeeter? Stent), (19) 


When the median is being used as the demarcation value, that is, when 
€1 = é€, these become: 
On side 1 or on side 2 alone: 
P(s/—, median) = P(—/s, median) = 1 — ga aaa (20) 
On each side: 
P(s/s, median) = (1 — aaa (21) 
On either side: 


P(s/— or —/s, median) = 1—e"” ’. (22) 
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Asymptotic relationships of this type do not add much to the solution 
of the practical problem of calculating probabilities associated with 
samples of 100 or less. They do, however, suggest that doubling sample 
sizes for a given probability should increase s by unity. This is in close 
agreement with the calculations for finite sample sizes. This observation 
suggested the treatment in the next section. 


VIII. RELATIONSHIPS BETWEEN S$ AND 72 FOR CONSTANT PROBABILITY 


From (20), (21), and (22), it is clear that, for constant probability, s 
is asymptotically a simple function of n for each of the arrangement dis- 
tributions considered for runs relative to the median. Specifically, we 
obtain: 

On side 1 or on side 2: 
a log n — log (— loge dQ — P)) _ 


log 2 : 


where P = P(s/—, median) (23) 


= P(—/s, median); 
On each side: 
« = gm — log (— log. A — VP)) _ ; 
log 2 (24) 
where P = P(s/s, median); 
On either side: : 
ues log n — log (— log, (1 — P)) 
log 2 (25) 
where P = P(s/— or —/s, median). 





After considering equations (23), (24), and (25), it is quite obvious that 
an equation similar to (24) in the same way that (25) is similar to (23) 
could be written, 1.e.; 


fits log n — log (— log, (1 — ~/P)) 
log 2 (26) 
where P = P[(s/— or —/s)/(s/— or —/s), median] 
but what is the meaning of P? It is clear that the P in (26) is approxi- 


mately the square of the P in (25). So far, however, no analytic justifi- 
cation for (26) has been obtained, although the P in (26) is obviously 


TABLE XVI 


Constants for equation (27) calculated from equations (23) to (26) and 
tables VII to X 












































8 Differences at equal to 

g Table| P A B Cc 

Pst 10 20 40 60 100 200 

23 | VIT/0.001} 5.151 | 126.6 — 266.5 0 0 0 0 —0.01/+0.01 
0.01 4.863 53.19 |—105.1 0 +0.02)/—0.02/—0.01) 0 +0.02 
0.02 4.445 39.61 —79.16 0 +0.01)—0.01/—0.01} 0 +0.02 
0.025; 4.306 | 35.34 —71.08 0 +0.01/—0.01/—0.01; 0 -++0.02 
0.05 |; 3.127 28 .03 —57.71 0 +0.01/—0.01/—0.01| 0 +0.02 
0.10 3.127 13.95 —32.83 0 +0.01/—0.01/—0.01} 0 +0.01 
0.50 | 0.5297) —3.126 —0.1306) 0 0 0 0 0 0 
0.90 |—2.576 | —6.757 10.96 0 0 0 +0.01) 0 0 
0.95 |—3.442 | —7.244 13.84 0 0 0 0 0 0 
0.975)—4.227 | —7.185 15.44 0 0 0 0 0 —0.01 
0.98 |—4.441 | —7.326 16.29 0 —0.01; 0 0 0 —0.01 
0.99 |—5.128 | —7.164 17.78 0 —0.01/+0.01| 0 0 —0.01 
0.999} —7.829 0.3596 8.278 | 0 0 —0.01/—0.01!+0.06}—0.04 

24 | VIII/O.001}/—0.3002) 23.36 —32.54 0 —0.01/+0.02) 0 0 —0.01 
0.01 0.0467) 10.93 —13.30 0 0 0 0 0 0 
0.02 0.0048 8.596 | —11.14 0 0 0 0 0 0 
0.025;—0.0005; 7.612 —9.868 | 0 0 0 0 0 0 
0.05 |—0.0672| 4.695 —6.288 | 0 0 0 0 0 0 
0.10 |—0.2136 1.668 —2.139 |} 0 0 0 0 0 0 
0.50 |—1.573 | —4.142 6.576 | 0 0 0 +0.01} 0 —0.01 
0.90 |—3.660 | —6.518 13.04 0 0 0 0 0 —0.01 
0.95 |—4.408 | —6.591 14.75 0 0 0 0 0 —0.01 
0.975)/—5.039 | —6.665 16.21 0 0 0 0 0 —0.01 
0.98 |—5.218 | —6.728 16.72 0 0 0 +0.01; 0 —0.01 
0.99 |—5.822 | —7.068 18.19 0 0 0 +0.01) 0 —0.01 
0.999;—7.4380 | —6.861 23.26 0 —0.01} 0 +0.01) 0 —0.01 

25) IX/0.001;) 4.879 | 154.6 — 324.2 0 0 +0.01} 0 —0.02/-++0.01 
0.01 4,902 | 73.86 |—145.9 0 +0.01/—0.01/—0.01| 0 +0.02 
0.02 4.764 55.83 |—109.5 0 +,0.01)—0.01;—0.01) 0 0 
0.025; 4.611 51.88 {|—102.5 0 +0.01/—0.01/—0.01; 0 +0.02 
0.05 4.432 35.72 —71.31 0 +0.02/—0.02/—0.01} 0 +0.02 
0.10 3.847 25.64 —54.79 0 +0.01/—0.01/—0.01} 0 +0.02 
0.50 2.524 1.680 | —13.88 0 +0.01/—0.01| 0 0 +0.01 
0.90 1.141 | —9.694 7.601 0 0 0 0 0 0 
0.95 | 0.759 |—12.16 12.83 0 0 0 0 0 0 
0.975) 0.422 |—14.08 17.14 0 0 0 0 0 0 
0.98 0.356 |—14.74 18.61 0 0 0 0 0 0 
0.99 0.081 |—16.61 22.91 0 0 0 +0.01} 0 0 
0.999;—0.600 |—21.68 34.97 0 —0.01/4+0.02) 0 —0.02/+0.01 

26 X/0.001)—4.176 60.34. | —69.24 |+0.01]—0.01| 0 — |4+0.01;) — 
0.01 |—2.176 | 39.32 —49.98 /+0.01/+0.01/—0.01) — 0 — 
0.02 |—1.762 34.71 —47.65 0 +0.01/—0.02; — |—0.02;) — 
0.025) —1.427 32.01 —45.06 0 +0.02;—0.01; — {;—0.01; — 
0.05 |—0.830 26.26 —41.04 0 +0.02}—0.01; — -j—0.02) — 
0.10 |—0.356 ; 21.25 — 38.10 0 +0.01;—0.01) — j—0.01) — 
0.50 1.069 1.932 | —14.67 0 +0.01/—0.01} — 0 — 
0.90 1.186 |—10.55 5.014 0 +0.02/—0.02) — 0 — 
0.95 1.369 |—17.16 18.39 0 +0.01)—0.02} — i4+0.02; — 
0.975) 1.222 |—19.89 24.12 j—0.01/+0.01/—0.02; — |4+0.02) — 
0.98 1.007 |—19.78 24.69 |—0.01/+0.01;/—0.02; — [40.01; — 
0.99 0.679 -|—21.66 30.15 |—0.01| 0 —0.02; — |40.02) — 
|0.999 0.408 |—29.73 48.79 0 0 —0.02;} — |+0.02} — 
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As versus 1/YN 


ONE SIDE 

EACH SIDE 
EITHER SIDE 

EACH SIDE, ANY CUT 



































Fic. 7 — Differences between interpolated values of s computed from Tables 
a to XV, inclusive, and appropriate equations (23) to (26), inclusive, for P = 0.01 
and 0.99. 


the maximum value possible for P(s/s, any cut). Nevertheless, as we 
shall see below, it appears to predict empirically the large sample be- 
havior of runs above and below any cut even better than (23), (24), and 
(25) predict the large sample behavior of the other types of run. 

For this comparison, values of s corresponding to particular values of 
P were interpolated (in a few cases, extrapolated) from the exact deter- 
minations of Tables XII to XV. Since the distributions for each sample 
size in these tables had been found to be mildly deviant from log-normal, 
the interpolation process first obtained a three point log-normal rela- 
tionship in the P area of interest by changing the s-scale to an (s + a)- 
scale. Here, a is the constant that must be added to s to produce the log- 
normal relationship in the interval under consideration. Values of s for 
each P, n, and type of run were obtained to four decimal places. 
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In each case, the difference between the interpolated value and that 
given by the appropriate equation (23), (24), (25), or (26) was calcu- 
lated. At this point, it was found that some of these differences for a par- 
ticular P and type of run could be approximated by linear equations in 
1/n or 1/-/n. In view of this, all have been fitted by the equation: 


ies Ala Se. C 


Vn n ~/n3 
The constants, A, B, and C, have been recorded in Table XVI. The 
agreement between the values given by this equation and the differences 
on which they were based seldom exceed 0.02. Thus, it was assumed that 
(27) provided a reasonable approximation for extrapolation to the 
larger sample sizes for which values are shown in Tables I to IV and in 
Figs. 1 to 4. 
To illustrate the agreement with (27), some typical results for P’s of 
0.01 and 0.99 are given in Tig. 7. All show that the differences converge 
in a reasonably uniform manner to zero at infinity. 


(27) 
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Properties of Control Chart Zone Tests 


By S. W. ROBERTS 
(Manuscript received September 10, 1957) 


This paper 1s concerned with the statistical properties of tests com- 
posed of the standard control chart test supplemented by one or more 
tests for runs of points in various zones into which the control chart is 
partitioned. The basic properties of the resultant tests, called zone tests, are 
illustrated graphically. A procedure for determining the properties of many 
zone tests of practical interest is described. 


I. INTRODUCTION 
1.1 General 


In using an X control chart to maintain control of a process average, 
we periodically measure n units of the product and plot the average 
measurement X, on the control chart in its chronological position. The 
control chart presents a pictorial summary of production history that is 
useful in: (a) detecting changes in the process average, and (b) pro- 
viding clues to the causes of such changes. Various run tests have proved 
useful in application (b).’ Most of the literature on run theory pertains 
to this application. There are tests for runs up and for runs up and down; 
there are tests for the number of runs and for the lengths of runs. The 
control chart is particularly suitable for run tests. We shall consider the 
use of a particular type of run test in application (a). 

In application (a), as each point is plotted we decide whether or not 
to look for trouble (to take action to eliminate the cause of the change 
in the process average). Using the standard control chart test,” * we look 
for trouble if a point falls in a zone outside of two control limits sym- 
metrically placed on either side of a line representing the nominal proc- 
ess average. The control limits, called the 3 (8-sigma) limits, are placed 
at Xo’ - 3(0’/+/n), where X)’ and o’ are the nominal process average 
and standard deviation, respectively, and n is the sample size, or num- 
ber of units of product measured for each point. We shall assume that 
X,’ and o’ are known, and that oc’ remains fixed. 
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In using a statistical test to decide at each point whether or not to 
look for trouble, we are subject to two types of errors: 

(1) We make Type 1 errors when we decide to look for trouble when 
in fact none is present. 

(2) We make Type 2 errors when we decide not to look for trouble 

when trouble is actually present. 
Few Type 1 errors are made when the standard control chart test is 
used — an average of about one point in 370 falls outside of the 3c limits 
when the process average is at its nominal level. Type 2 errors occur at 
consecutive points following a change until the test used indicates that 
a change has occurred. Small changes may result in long sequences of 
Type 2 errors because the probability of a point falling outside of the 
30 limits may be small, though larger than it was when the process aver- 
age was at its nominal level. This definition of the two types of errors 
makes a sharp distinction between the presence and absence of trouble — 
a distinction more theoretical than practical — in order to simplify the 
exposition of the subject. 

Experience indicates that, in general, the standard control chart test 
maintains an economic balance between the two types of errors in a 
wide range of industrial applications (Reference 2, pp. 276-7; Reference 
3, p. 11). However, other tests may be more attractive economically in 
applications where early detection of relatively small changes is impor- 
tant. It has been suggested (Reference 4, p. 128) that supplementary 
run tests may prove useful in such applications. Various run tests are 
used in practice to supplement the standard control chart test,* but 
little has been published on the properties of the resultant tests, t though 
it is quite apparent that each additional supplementary run test em- 
ployed decreases the number of Type 2 errors made and increases the 
number of Type 1 errors. 

There are several alternative ways to reduce the number of Type 2 
errors made; we can: 

(1) Set the limit lines closer to the nominal process average X,’. 

(2) Increase the sample size.’’’ 

(3) Replace the standard test with a single test for runs of points 
outside of appropriate limits.® 

(4) Supplement the standard test with one or more run tests. 

(5) Temporarily modify the sampling procedure — e.g., increase the 
sample size or frequency of sampling — whenever a point falls outside 
of “warning” limits but inside of the “action” limits (30 limits).*’ 
~ * See footnote, page 89. 


+ After the page proofs of this paper had been received, the author was advised 
of Reference 9, which deals primarily with the test T12(Zi , Le). 
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(6) Use a control chart for a statistic other than X,, ; for example, plot 
points representing the moving average of k consecutive X,,’s. 

The improvements generally require extra information or more com- 
plicated tests, or they result in an increased frequency of Type 1 errors. 

In this paper we study the properties of various run tests that either 
replace or supplement the standard control chart test in application (a). 
We limit our study to a particular type of run tests which we call ‘‘zone 
tests’ because they test for runs of points in various zones into which 
the control chart is partitioned. For example, we study such tests as 
Ty» (8, 2),t which calls for action if a single point falls outside of the 3c 
limits or if two of three consecutive points fall outside of a 2c limit line. 

We limit our studies to tests used on charts of the statistic X, ; zone 
tests can be useful on other charts, but their properties depend on the 
properties of the particular statistic plotted. Our results apply for any 
sample size and frequency of sampling. 

We use 7,,(L,,) to denote a test for k consecutive points outside of one 
of the pair of limit linesat Xo’ + Li(o’/-Vn), and T;-(L;) to denote the 
test for k out of & + 1 consecutive points outside of the limit lines. If 
we combine two tests, we let 7:,:,(Z:,, Lx.) denote the combined test 
that calls for action on the occurrence of ezther type of run; k; and ky are 
integers less than nine, either primed or unprimed. 

For simplicity of notation we may eliminate the brackets on the test 
notation if the subscripts provide sufficient information. For this purpose, 
we adopt standard limits for certain runs. Thus we may use 7; rather 
than T,(3) to denote the standard control chart test. Also, we use the 2c 
limits, the lo limits, and X’ itself as standard for runs of lengths 2, 4, and 
8, respectively. Thus 7» means T'»/(8, 2), and 7's means 7',(0). We use 
an asterisk to denote one-sided tests — those with limit lines on only one 
side of Xo’. Test 7,* has a single limit line, at Xo + 3(0’/-VW/n). 


1.2 Process Model 


We use a process model in which the process average is X’ = Xo’ + A, 
where A is subject to change. A picture showing how A changes with time 
would show a series of rectangular pulses (positive or negative) of vari- 
ous heights, separated by periods with A = 0. The beginning of a pulse 
corresponds to the occurrence of an assignable cause of variation, and 
the height of the pulse is a function of the particular cause. The pulse 
ending corresponds to the elimination of the trouble. The distribution of 


t Read subscript as 1, 2’. 
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the lengths of the pulses depends on the test we use to detect changes; 
the test should be designed to keep the lengths reasonably short. 

The sample average, X,, is assumed to have a normal distribution 
defined by its expected value X’ and standard deviation o’//n. 

Whenever A = 0, the process average is at its nominal level, and we 
say the process isin State 1. Whenever A =~ 0, there is trouble present, 
and we say the process is in State 2. We assume that no additional 
changes occur while the process remains in State 2. 

At each point we look for certain runs that rarely occur in State 1. 
In the absence of such runs there is no indication that the process is not 
in State 1, and accordingly we do not look for trouble. We do not at- 
tempt to define the probability that the process is in State 1 at any 
point. In this model, we stop the process to look for trouble on the jirst 
occurrence of a run for which we are testing. When the process starts 
again it is assumed to be in State 1; consequently, the testing procedure 
ignores previous points. 

Relatively straight-forward mathematics can be used to describe the 
properties of certain tests acting within the framework of this process 
model. Alternative, and perhaps more realistic, assumptions can easily 
lead to much more complicated problems of description. In many cases 
the results obtained here can be used to describe qualitatively the prop- 
erties of tests applied to more complex processes. 


1.3 Measuring the Two Types of Decision Errors 


As each point is plotted on the control chart we decide either that the 
process is in State 1— in which case we leave it alone — or that it is 
in State 2 — in which case we look for trouble. We make a Type 1 error 
when we say that the process is in State 2 when actually it is in State 1; 
Type | errors initiate needless action. We make a Type 2 error when we 
say that the process is in State 1 when actually it is in State 2; Type 2 
errors fail to initiate needed action. We generally make a series of con- 
secutive errors of Type 2 before detecting the change in state. 

Let the random variable y denote the number of points plotted while 
the process remains in State 2. Then y — 1 consecutive errors of Type 2 
are made. Let L(y) denote the expected, or average, value of y; then 
E(y — 1) is the average length of a series of Type 2 errors. 

Ey) depends on A, the amount by which the process average changes; 
we sometimes note this dependence by writing H(y; A). L(y; A) is a 
monotonically decreasing function of the magnitude of A; that is, the 
larger the change, the smaller is Z(y). In other words, tests are more 
sensitive to large changes than to small changes. 
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Fig. 1 — E(y) versus A for 7',(Z;) for various limits. 


Fig. 1 shows curves of H(y) versus A for T;(Z,) for Ly = 2, 2.5, 3, and 
3.5. Note on the curve for 7,(8), for example, that H(y) = 15 at A = 
1.5 (o’/+/n); this means that following a change of this magnitude, an 
average of 15 points are plotted before a point falls outside of a 3¢ 
limit. Note that as A approaches zero, /(y) approaches 370, which cor- 
responds to the average number of points between consecutive Type 1 
errors while the process remains in State 1. 

In Fig. 1 and later figures the abscissa is A, and it is measured in units 
of o’/+/n, which is the standard deviation of X,. The particular ab- 
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scissa that applies to a change of a given physical magnitude is propor- 
tional to +/n; for example, if n is doubled in the above example where 
A = 1.5(c’//n), then the appropriate abscissa on Fig. 1 increases from 
1.5 units, with ordinate E(y) = 15 on curve 7), to 1.5+\/2 = 2.121 
units, with H(y) = 5.4. The positions of the curves relative to one an- 
other are independent of n. 

If the process were to remain in State 1 indefinitely, E(y; 0) would 
represent the average number of points between consecutive Type 1 
errors, and 1/[E(y; 0)] would be the asymptotic probability of a Type 1 
error. In comparing tests with respect to Type 1 errors, we compare 
their values of E(y; 0). 

In comparing tests with respect to Type 2 errors, we compare their 
values of E(y), or H(y — 1), for various non-zero values of A. 


1.4 Comparing the Statistical Properties of Various Zone Tests 


We are primarily interested in the distribution of y. The distribution 
of y for all of the zone tests we consider can be adequately summarized 
by one parameter — its average value H(y) (see Section 3.1). Therefore, 
in comparing the statistical properties of various tests, we compare their 
curves of E'(y) versus A. From such curves we can determine the asymp- 
totic probability of Type 1 errors, 1/[E(y; 0)], and the average number 
of consecutive Type 2 errors, E(y — 1; A), for any A different from zero. 

Figure 1 illustrates how the properties of zone tests can be changed by 
changing the limit lines. By changing the limit lines of 7,(Z;) from I, = 3 
to LZ, = 2, we reduce E(y) for all values of A: when A > 0, this means 
that the Type 2 errors are reduced; when A = 0, this means that 
Type 1 errors are increased. 

A choice between two tests should be based partially on the relative 
values of the two types of decision errors. We can fix the Type 1 errors 
at any desired level by an appropriate setting of the zone limits; then 
the Type 2 errors alone serve as a basis of comparison. 


II. SUMMARY OF RESULTS 


Section 4 shows how to determine the distribution of y, and in partic- 
ular its average value H(y), for one-sided tests 7;,*(Zx) and Ty*(L,), 
for any k. Simple substitutions into equations for the above one-sided 
tests allow us to determine the properties of any test of type Ti,*(Zi , Lx) 
or Ty*(,, Ly). We then determine the properties of two-sided tests 
from the properties of the corresponding one-sided tests. 

We show that the average values of y in any two separate tests provide 
upper and lower bounds to the average value of y in their combined test. 
Thus with subscript ¢, denoting test 7, , fe denoting 7: , and tt, de- 
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noting the test 7.,., combining 7, and T;, , we have upper bounds 
Buye(y; A) S Ei(y; A), Bunly; A) S E.(y; A), (1) 
and a lower bound 
ee ee ee ny eee 
Bni(y; 4) Byys A) Bin(y; A) 


An application of (2) to the determination of the properties of two-sided 
tests in terms of the properties of their component one-sided tests yields 


i 1 1 
EW; a) = B*y; a) 1 By; =A)’ 7 
where the asterisks denote one-sided test results. 

We can determine the properties of the following tests: 7;(Zz), 
Ty (Le), Tidy, Lx) and Ty(Ly, Lx), for any k. With L, = 3, the last 
two types of tests supplement the standard control chart test 7,(8) 
with one other zone test. 

Equations (1) support the logical conclusion that the more criteria 
we have to indicate the presence of trouble, the more quickly we will 
look for trouble when it is present as well as when it is not present. Thus, 
in supplementing the standard control chart test with other tests, we 
decrease the Type 2 errors at the expense of more frequent Type 1 er- 
rors. The question of how far to go in supplementing the standard control 
chart test must be answered in light of the relative importance of the two 
types of errors in the particular application considered. 

Section IIT presents a series of charts to show the properties of several 
particular tests, including 71, Tr, Ti, Tis, and Ty43 . The last test* 
illustrates the effect of supplementing 7 with more than one additional 
test; its properties were determined through the use of Monte Carlo 
techniques. We also show E(y) versus A for several tests when their zone 
limits are translated away from the center line so that their Type 1 
errors are comparable to those of 7’, . It is through such translations of 
zone limits that we can offset the undesirable effect on Type 1 errors 
that occurs when we add new tests to our testing procedure. 


(2) 


III. CHARTS SHOWING PROPERTIES OF VARIOUS ZONE TESTS 


3.1 Distribution Function of y 


The cumulative distribution function of the random variable y, 
Q; = Prob (y > J), is shown in Figs. 2 and 3 for various zone tests. 


* This test is similar to one that has been used by the Western Electric Company 
in its quality control training program; somewhat different criteria for taking 
action are used and therefore the statistical properties differ. 
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The curves are applicable only at integral values of 7. If y > 7, there 
have been no indications of a changed process average in the first 7 
points following the change from Xo’ to Xo’ + A. 

Vig. 2 shows curves for 71, 7, Ts, and 73 for A = 0, o’/+/n, and 
2(o'/~/n). Fig. 3 compares 7, Ty , Ti , and Ty4s3 for A = o’/+/n; 
it illustrates the effect of additional tests on the distribution of y. 
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Fig. 2— Cumulative distribution of yfor 71 , 7's’ , Tv , and Tsfor A = 0, o’/-Vn, 
and 2(0'/~/n). 
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Fig. 3 — Cumulative distribution of yfor 71, 7’'o’, Tio’ and Ti0's’s for A = o//V/n. 


The curves of Figs. 2 and 3, plotted on semilogarithmic paper, can 
be approximated for practical purposes by straight lines. Thus, the dis- 
tributions are approximately geometric, or discrete exponential, dis- 
tributions that can be described by a single parameter E(y) and an 
initial value. It is for this reason that /(y) adequately summarizes their 
statistical properties. 
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Figs. 2 and 8 illustrate the fact that single tests for long runs, such as 
Ts; , do not become fully effective immediately following a change. 


3.2 Curves of E(y) Versus A for Tests with Standard Zone Limits 


Figs. 4, 5, and 6 illustrate typical curves of H(y) versus A. Fig. 4 
shows curves for 7, , Ty , Ty , and 7's. Fig. 5 shows the effect of broad- 
ening the criteria for looking for trouble — T. calls for action only if 
two consecutive points fall outside of a 20 limit, whereas 7. calls for 
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Fig. 4 — E(y) versus A for T;, To’, Ts’, and 73. 
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Fig. 5 — E(y) versus A for 71, T'2, Ts’, Ti. , and Ti’ . 


action whenever 7, does and also whenever two points falling outside a 
2o limit are separated by a single point not falling outside of the 20 limit. 
E(y) is less for Tx than for 7, for all values of A; this difference is re- 


flected in the curves for 7. and Ty , which supplement 7’; with 7, and 
Ts , respectively. 
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Fig. 6 — E(y) versus A for 71, Ts , Tis , and T12'4’s . 


Tig. 6 illustrates the effect of supplementing 7 first with 7’s and then 
with 7s, Te, and Ty. Notice how the Type 1 errors become more 
frequent as Type 2 errors decrease. 


3.3 Curves of E(y) Versus A with Limits Set for a Selected Probability 
of Type 1 Errors 


Fig. 7 shows curves of £(y) versus A for tests for k (k = 1, 2, 3, 4, 6, 8) 
consecutive points outside of limits that are set for each k so that the 
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probability of a Type 1 error is comparable to that of 7, . Tests for long 
runs clearly are most effective against small process changes, while 7, 
itself is most effective against large process changes. 

Tig. 8 shows curves for T,, 7's(0.065), and 718(3.19, 0.19). The last 
test is composed of the first two tests with all zone limits translated away 
from Xo’. Notice that 7, and 1's(0.065) taken individually are more 
effective than 7(3.19, 0.19) in certain ranges of A. Fig. 9 illustrates 
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Fig. 7 — E(y) versus A for T;,(Lx) for k = 1, 2, 3, 4, 6, and 8 with limits set for 
the same probabilities of Type 1 errors. 


96 


400 





















ARCOCCCEEeeee 
val ee ese es eal 
NO De ailh olla tien a edt led 
“CNKECEET Ee 
ee ee 
is nba Vee es ee Ol 
5 a Ss coy ae te 
: CoN ere T rr 
(SENSE 
ew! | | TAIN | tT tT tT tt 
Cope AS 
aS ee 
ee eee 
ya Ga a i a as Da 
Ce 0 ee eS 
+44 it tt Ss i 
oe ee 
POeCece ee 


A IN UNITS oF —Z 
vn 


THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1958 




















3.0 3.5 


Fig. 8 — E(y) versus A for 71 , T's(Ls), and Tis(Li , Ls) with limits set for the 


same probabilities of Type 1 errors. 
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: Fig. t= E(y) versus A for 7; ; T2'(Le'), To’ y ; D2‘), and Ty2'4’3( Ly , Da! . Ly! ; Ls) 
with limits set for the same probabilities of Type 1 errors. 
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the same general ideas as Fig. 8, with the addition of Ty2-43 with its zone 
limits translated away from Xo’. 

Because logarithmic scales are used for E(y), the differences 
Ey\(y) — Ey) between curves for 7, and other tests are distorted; 
Fig. 10 shows the difference on an arithmetic scale for two of the curves 
of Fig. 9. 

Fig. 11 supports the theory that 7',,(Z,-) is slightly more sensitive 
to small changes than 7',(Z;,) when the limits are set so that the two 
tests have the same probabilities of Type 1 errors. Further graphical 
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Fig. 10 — The difference between ordinates of curves of Fig. 9 shown on an 
arithmetic scale. 


PROPERTIES OF CONTROL CHART ZONE TESTS 99 


400 


300 





200 





150 





100 
90 
80 
70 


60 
50 








40 





30 








E (y) 
















































































0.5 1.0 ; 
ao! 
A IN UNITS OF —= 


Vn 


Fig. 11 — E(y) versus A for 74(Z,4) and 74'(Z,’) with limits set for the same 
probabilities of Type 1 errors. 


support is given by curves for 7'(1.93) of Fig. 9 and T2(1.78) of Fig. 7. 
No analytical proof has been developed. 


IV. DETERMINING THE STATISTICAL PROPERTIES OF ZONE TESTS 


4.1 General Procedure 


With the control chart partitioned into mutually exclusive zones A, 
B, C, D, +++ , R, we represent a sequence of points falling consecutively 
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into zones B, C, D, and B, for example, by the sequence bedb. The lower 
case letters such as b serve a dual purpose — they denote the fact that a 
point falls intoa particular zone, and they denote the probability of that 
particular event, or outcome. l’or example, the probability of a particular 
sequence bcdbedb is b’c’d’. Where there is danger of confusion, we may 
denote outcome b by e, and its probability by p,. A sequence bcdb is 
considered to represent the outcome of a sequence of independent trials, 
each of which has fixed probabilities of outcomes a,b,c, --- , 7. 

Since the control chart points represent an average measurement X, 
that has a normal distribution with average Xo’ + A and standard de- 
viation o’/+/n, we use normal probability tables to determine the proba- 
bility b, which remains constant from point to point as long as the proc- 
ess remains in a given state. If (7) is the area under the normal curve 
above x, and if zone B is between limit lines at Xo’ + Le(o’/+/n) and 
Xo + Lilo’/-Vn), where L. < L,, then probability 


b = (1: = avin) _ @ (1: = ayn) 








a’ go’ 

When the process changes from State 1 to State 2, the probabilities 
of points falling into the various zones change. At the first point in State 
2, zone tests see one point from State 2 preceded by a sequence of points 
from State 1; at each subsequent point in State 2 a single point from 
State 1 is dropped from consideration, until at last all points considered 
are from State 2. The zone tests are such that the probability of a point 
from State 2 falling into a critical zone is greater than the probability of 
a point from State 1 falling into the same zone. Consequently, the proba- 
bility of the occurrence of a run of points in a critical zone is greatest if 
all of the points are from State 2. For simplicity and clarity we neglect 
points from State 1 while considering the results of testing points 
from State 2. This means that 7’; , for example, does not become effec- 
tive until the eighth point in State 2 appears. This simplifying assump- 
tion will affect the results little; its effect can be eliminated by calculat- 
ing the probability of detecting the change in the first few points and 
adjusting our results. As an illustration, 7'3(0.065) of Fig. 7 should ap- 
proach 7.1, rather than 8, as A approaches infinity. 

If a control chart is partitioned into three mutually exclusive zones 
A, B, and C, outcomes a, b, and ¢ are associated with the events that 
points fall in the respective zones, and probabilities a, b, and c 
(a + b +c = 1) are the corresponding probabilities of the events, or 
outcomes. The possible outcome of the first 7 trials, or points, can be 
enumerated by the ordered expansion of the multinomial (a + b + c)’. 
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For example, with 7 = 2, we have: 
(a+b+c) =aa+ab+ac + bat+ bb + be + ca + cb + ce. 


The probabilities of the various sequences occurring are obtained simply 
by multiplying the individual terms; for example, sequence aa has prob- 
ability a’. The probability of a particular event such as the event that 
either a or b occurs at least once in the first two trials is determined by 
selecting those sequences in which this event occurs and cumulating 
their probabilities; in this case it is a” + b° + 2ab + 2ac + 2be. 

If we wished to determine the probability Q; of no occurrences in the 
first 7 trials of an event ¢ (a run of eight consecutive points in zone A, 
for example), we could enumerate all of the 3’ possible outcomes, pick 
out those we were interested in, and determine their probabilities. This 
procedure becomes very tedious as 7 increases, and we soon look for 
shortcuts. We attempt to find a recursion equation defining Q; in terms 
of a limited number of terms Q;-1, Qj-2, etc. If we can find such an 
equation, we need to enumerate all pertinent outcomes only to the point 
where the equation becomes effective. 

A recursion equation for Q; , together with a set of initial conditions, 
leads to a generating function Q(s) whose power series expansion ex- 
hibits Q; as the coefficient of s’: 


Q(s) = 1+ Qs + Qu? + ++ + Qs’ f+ = Do Qis’. (A) 


The generating function is useful in obtaining moments of the distribu- 
tion of y. In particular, we obtain H'(y) by setting s = 1 in the equation 
for Q(s): Ey) = QC). 

The simplest zone tests are those in which a point is classified in one 
of two categories; it represents either event «, with probability p or 
event e, with probability gq = 1 — p. We arbitrarily call «, a success 
and e, a failure.* We call a test for success runs a simple run test. A 
compound run test is composed of more than one simple run test; for 
example, a test for a run of two consecutive points above the +2c limit 
is a simple run test, but a test for a run of two consecutive points above 
the +2o limit or below the —2c limit is a compound run test composed 
of two simple run tests. A simple run test classifies points in two ways; 
a compound run test classifies points in more than two ways. 

The test for a run of two consecutive points above the +2c limit is a 
one-sided zone test; the test for a run of two consecutive points above 


* This terminology may seem incongruous, since we hope for events e, , which 
we term failures. Alternatively, we could change the definition, and say that we 
test for failure runs, but this conflicts with standard terminology. 
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the +20 limit or below the —2c limit is a two-sided zone test. We derive 
the properties of two-sided tests from those of one-sided tests. 

In Sections 4.2 and 4.3 we present recursion equations and generating 
functions for Q;, the probability that y > j, for the following simple 
run tests: 

(1) & consecutive successes b= 1,9,37 4, 408% 

(2) k successes in k + 1 (or k) consecutive trials k = 2, 3, and 4. 
In addition, we describe a procedure for extending & in (2) to any value. 
Equations for E(y) are also presented. The results apply to one-sided 
zone tests. 

Section 4.4 describes a procedure for determining the properties of 
two-sided zone tests from the properties of one-sided zone tests. 

Section 4.5 presents a procedure for determining the properties of any 
run test combined with a test for a single point in a critical zone. Simple 
substitutions into the equations for a particular one-sided zone test lead 
to a description of the properties of that test in combination with the 
standard control chart test 7':*. 

Section 4.6 develops upper and lower bounds to L(y). Section 4.7 
shows how to determine easily the properties of some tests whose zone 
limits are non-standard. Section 4.8 discusses the use of Monte Carlo 
techniques for determining the properties of tests more complex than 
those considered here. 


4.2 The First Occurrence of k Consecutive Successes 


We separate those sequences of outcomes having no occurrences of k 
consecutive successes in the first 7 trials (that is, y > 7) into mutually 
exclusive categories according to whether the last failure occurred on 
trial 7,7 — 1,7 — 2,--- orj —k + 1. With Q; denoting the probability 
that y > 7, we let Q;,; denote the probability that y > 7 and that trial 
7 — tresulted in a failure and the succeeding 7 trials resulted in successes. 
Then, since 7 can be no greater than k — 1, we have the equation: 


Q; = Qio + Qin + Qo Oye. (5) 
We enumerate the possible results: 


Sequence 


Endings Probabilities of Occurrence 
te eerenes q Qi0 = Q(Qi-1.0 + Qian + ove) + Qj-1,2-1) 
-Qp QQ. = pq(Qj-2.0 + Qy-2 + e+ + + Qi-2,4-1) 
tetiasel qpp Qi.2 = p?q(Qj-3.0 + Qi-31 + +++ + Qj-3,%-1) 
sees qppp pa = pq(Qj-4,0 + Oya + oo H+ Qy-4,4-1) 


qpp :*: p Qj.n-1 = pP'g(Qi-n.0 + Qik + ove + Q5-x,e-1) 
ee~_--——" 
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The equations on the right reduce to Q;,; = p’qQj-i-1. We obtain the 
desired recursion equation by summing over all values of 2, 

Qs = qQia + pqQs-2 + p'aQi3 + + +p "qQi-+. (6) 


We can use (6) to calculate Q; for 7 2 k, noting that Q; = 1 for7 < k. 
We obtain the generating function of Q; from (6), 


Q(s) = a Se (7) 
1 — st qpkstt” 
Then E(y) is obtained by setting s = 1 in (7), 
| a k 
Ey) = —2.. (8) 





qp* 
These results are well known.® 


4.3 The First Occurrence of k Successes in k + 1 Consecutive Trials 


As in the preceding section, we separate those sequences having no 
occurrence of the event in question — in this case k successes in k + 1 
consecutive trials —into mutually exclusive categories according to 
whether the last failure occurred on trial 7,7 — 1,7 — 2,---,orj7 —kK+1. 
In the current problem, however, we are also interested in the location . 
of the next-to-the-last failure since if the event in question has not oc- 
curred there must be at least two failures in the preceding k + 1 trials. 
If the last failure was on trial 7 — (k — 2), for example, there must be 
at least one other failure in the preceding two trials. Here an enumera- 
tion of possible results yields: 


ean Probabilities of Occurrence 

rote eeeees GQi.0 = G(Qj-1,0 + Qj-ra Hote + OQj-t.e—2 + Qj-a.z-1) (9.0) 

eee eeees gp Qin = pq(Q;-2,0 + Qj-211 os + Qj-2,4-2) (9.1) 

vane noes app Q5.2 = p?q(Qj-3,0 + ae + Qj-3,%-3) (9.2) 

gp +: ppp Qjn-2 = pF *q(Qi a0 + Qi-a-n,1) (9.(k — 2)) 

qpp -:: ppp Q;,n-1 = p*'q(Q;-«.0). (9.(k — 1)) 
k—-—1p’s 


Each equation in (9) has one term less than the equation immediately 
above it. We adopt a standard procedure for deriving a recursion equa- 
tion for Q; from equations (9). First we find from (9.0) that: 


Qi0 = Qiu. (10) 
Then we substitute (10), with j reduced by k, into (9.(k — 1)): 
Qixa =p TQia4. (11) 
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Next we translate the final term on the right-hand side of (9.0) to the 
left-hand side, and substitute (10) and (11), the latter with 7 reduced 
by one. Then, if we multiply through the new (9.0) by p and reduce j 
by one, its right-hand side is identical to that of (9.1). Then we have 


Qia = pgQi2 — p'G Qj. (12) 


We substitute (10) and (12), with 7 reduced by (k — 1), into (9.(k — 2)) 
to obtain 


Qir2 = PG Qi + DY Qi — PG Qi - (13) 


We proceed step by step, taking equations from the top and then from 
the bottom, to find equations for the Q,,,’s in terms of Q,’s. Then we 
add all of the equations together to obtain the recursion equation for 
QQ; , which will depend on some of the k(k + 1)/2 immediately preceding 
Q;’s. The recursion equation is used with k(k + 1)/2 initial Q;’s to de- 
rive the generating function Q(s). 


4.31 The First Occurrence of Two Successes in Three Consecutive Trials 


As in (9), we have: 


Sequence Endings Probabilities of Occurrence 
“q Qi.0 = G(Qi-1,0 + Qj-1,1) (14.0) 
qp Qin = pq(Q;-2,0). (14.1) 
Then Qo = qQj;-1, Qja = p¢'Q;-s, and the recursion equation is 
Qi = Qian + pe Qi-s, j>2. (15) 


With (15) and the initial conditions Q@ = Q: = land Q, = 1 — p’, 
we derive the generating function for Q;: 


_ 1+ ps + pas’ 
Q) = _ LR. (16) 
E(y) is obtained by setting s = 1 in (16); 
1+ p+ pq 
EG) = 2 Ss 17 
Y= BOF “ 


4.32 The First Occurrence of Three Successes in Four Consecutive Trials 


The initial conditions are: 


QO = =Q = 1, 
Q=1->p’, 

Qs = 1-—p — 3p, 

Qs = 1— p — 3p'q — 3pid’. 
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Tor j > 5 we follow the standard procedure to find a recursion equation 
for Q; in terms of the 3-4/2 = 6 preceding Q,’s: 
Qs = Qi + pqQi2 + PQi-4 — DG Qi-s, j> 5. (18) 
The generating function is 


1+ ps + p's’ + pgs’ — p'qs' — p'q's® (19) 


Ss bod 
Q(s) 1 — qs — pqs? — pst + pigs? ? 





and the expected value of y is 


LP pap eee 
p a 
Ey) ae (20) 


4.33 The First Occurrence of Four Successes in Five Consecutive Trials 
Here the 4-5/2 = 10 initial Q,’s are: 

OM=A=e2=a=1 QG=l-p, 

Qs = Q — 4p'g, 

Qs = Qs — 4p'¢’, 

Q: = Qs — 4p’ — 3p'¢’, 

Qs = Q: — 4p'q' — Tp’? — 2p'¢’, 

Qo = Qs — 4p'g? — Lp’g’ — 9p"y’ — p'd’. 
For j > 9 the following recursion equation holds: 


Q; = qQjat pqQi-2 + p'PQs-+ + 2p°¢Qi- 


(21) 
—p¢Qi-a — p'YQj-20. 
The generating function of Q; is 
1+ ps + p's’ + p’s® + 2p'gs" ~ pas’ 

= pqs = pas aa pgs 
= 2 
ee tre qs — pgs" — piqis' — 2piqis' re or 

+ n'Y sit py 10 


Then 
1+ p+ 2p" + 2p'q — pg — pig — Pg — Ve (23) 


EB = 
@ PA aS 20 + 2P + Pg) 


4.4 Properties of Two-Sided Zone Tests 


The results presented in Sections 4.2 and 4.8 are applicable to the 
study of the statistical properties of one-sided zone tests for runs of 
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points in the zone above an upper limit line at Xo’ + L.(o’/+/n). Gen- 
erally, we also test for the same types of runs below a lower limit line 
at Xo’ — Li(o’/+~/n), in which case the test is a two-sided zone test and 
each point falls into one of three mutually exclusive zones. 

Let A denote the zone above the upper limit, B denote the zone be- 
tween the two limits, and C denote the zone below the lower limit. Con- 
sider an infinite sequence of independent trials having possible outcomes 
a, b, and c with fixed probabilities a, b, and c. When the outcome of the 
jth trial completes a pattern of outcomes describing an event « we say 
that « occurs on the jth trial. Event ¢ is defined by a set of outcome pat- 
terns and a counting, or testing, rule. If when e occurs on the jth trial 
we treat trial 7 + 1 as though it were the first trial, ignoring the results 
of the first j trials, then ¢ is a recurrent event. 

Let 
Uj = Probability that ¢ occurs on the jth trial, 
fr, = «  « “for the first time on the jth trial, 

Q; = Probability that ¢ does not occur in the first 7 trials. 


Denote the generating functions of u;, f;, and Q; by U(s), F(s), and 
Q(s), respectively. 

The following equation can be used to determine the Q’s in terms of 
the f’s: 


Q(s) = 


If € is a recurrent event the following equation holds [Reference 8, 
p. 243]: 


1a rice <d Of 


‘ 


= fy + fyi + fy-eue + +++ + fuji. (25) 
Equation (25) leads to the following identity (setting fo = 0, uw = 1): 


if 


U(s) = Ta Fo 


(26) 
From (24) and (26) we have 


(1 — s) U(s) = ay =l<g<1, (27) 


for recurrent event e. We shall consider only recurrent events which have 
finite recurrence times; in these cases F(1) = fi + fe + -:- = 1, and 
U(1) is infinite. The limit of (1 — s)U(s) as s approaches unity from 
below is (using L’Hospital’s Rule): 

1 1 1 


lim (1 — s) U(s) = Fd) = = Od) ~ = Foy (28) 
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where y denotes the number of the trial of the first occurrence of e«, and 
E'(y) denotes its expected value. F(y) is also the average recurrence time 
(average number of trials between consecutive occurrences) of recurrent 
event «. 

Consider recurrent events «1, €2, and «2, defined, respectively, by the 
sets of outcome patterns a, 8, and a or 8 and a counting rule that re- 
quires counting to start from scratch on trial j(j7 > 1) if and only if the 
event under consideration occurs on trial 7 — 1. Assume that e and e 
are mutually exclusive — that is, they cannot both occur on the same 
trial. 

For an example, let the single pattern a c a define the set a and the 
pattern c ac define the set 6 — then the set a or 8 has the two patterns 
acaandcac. Consider an outcome sequence: 
trial number: 123 45 67 8 9 
trial outcome: @acaeaeaoba 
The event e occurs on trials 3 and 7; the event e, occurs on trial 4; and 
the event e2 occurs on trials 3, 6, and 9. 

Let F,(y), Holy), and Ey.(y) denote the average recurrence times of 
é1, €2, and ee, respectively. Under what conditions can we determine 
Ey(y) from known values of E,(y) and H2(y)? 

Consider events 4” and e¢,” defined by outcome patterns a and 8, re- 
spectively, and a counting rule that requires counting to start from 
scratch on trial 7 if and only if either ev” or e” occurs on trial 7 — 1. 
Events ey” and e,” differ from e, and e only in counting rules. In the 
example previously considered, we see that e” occurred on trials 3 and 
9, and e” occurred on trial 6. Either e,” or e” (but not both) occurs on 
every trial on which ei. occurs; this leads to the equation 


Me; = W,,;” + Ue2,;”, (29) 


where Ui2,;, U1, ;” , and we, ;” denote, respectively, the probabilities that 
ew, €.’, and e” occur on trial j. 

Multiplying (29) through by s’? and summing over j from one to in- 
finity, we obtain an equation relating the generating functions of the 
probabilities in (29): 


Uy2(s) = U,"(s) + U2" (s) — i. (30) 


The constant appears because up = 1 in all cases. 

Events e” and e&” are recurrent events, and equations (25) through 
(28) can be used to determine their mean recurrence times /,”(y) and 
Eo" (y). (The fact that (25) applies allows us to call ey” and e” recur- 
rent events). 
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If we multiply through (30) by (1 — s) and take the limit of each side 
as s approaches unity (see (28)), we obtain 


1 = 1 Fe 1 
Exe(y) Ey" (y) Tis!” (y) 


In any sequence of trial outcomes, if a pattern in a occurs for the first 
time on trial j, then « occurs for the first time on trial 7, and e” occurs 
for the first time either on trial 7 or on a later trial; «” will occur for the 
first time on a later trial if €” occurred while this first pattern in @ was 


being formed. Thus we have 
Ex(y) < Ey’ (y) (32) 


where the equality sign holds if and only if no pattern in 8 overlaps a 
pattern in a. A pattern in @ overlaps a pattern in @ if the terminating 
outcomes of the former correspond to the beginning outcomes of the 
latter. Thus outcome pattern c ac overlaps ac a because the terminating 
outcomes ac of the former correspond to the beginning outcomes ac of 
the latter. If no pattern in 8 overlaps a pattern in a then the occurrence 
e.” does not “cancel out” the beginning of any patterns in a, and there- 
fore e, and e,” always occur on the same trials. 
From (31) and (82) we have 


ao Sata 
Ex2(y) Fiy(y) Tis(y) 


where the equality sign holds if and only if e: and e2 are defined by non- 
overlapping patterns, in which case we shall say that e: and e2 are non- 
overlapping events. From our example it is clear that mutually exclusive 
events are not necessarily non-overlapping. 

We can use (83) to find Z(y) for two-sided tests in terms of the E*(y)’s 
of the component one-sided tests. Note that a given change A looks like 
a —A to one of the component tests. Then 


1 < 1 if i 
Ey; A) ~ E*(y;A)  E*(y; —A) 


For A = 0, E(y; 0) S (£*(y; 0)/2). The equality sign holds in (34) for 
T,(L,) and Ty.(21, L.). For Ty-(Lx) and T.(Ly1, Ly), (34) defines lower 
bounds which are very close approximations to H(y). For Ts equation 
(34) leads to a lower bound of 510.6, which compares with the true 
value li(y; 0) = 510.7. The degree to which the approximation ap- 
proaches the true value depends on the probability of overlap, which in 
cases we consider is very small; for this reason we can consider (34) to 
be an approximation rather than a lower bound. 


(31) 





(33) 


(34) 
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4.5 Properties of Tests Combining T,(I1) with One Other Test Whose 
Properties Are Known 


Consider any test 7' for which we partition the control chart into mu- 
tually exclusive zones A, B, C’, ---, R. The possible outcomes of the first 
7 trials can be enumerated by an ordered expansion of 


atb+et--: +7)’ 


With the letters denoting the probabilities of points falling into the 
various zones (a + b +e + --- +r = 1), we pick out all of those 
terms corresponding to outcomes in which the event ¢ does not occur, 
and denote their sum by 


Q1,; = gi(a, b, Cys r). (35) 


Clearly Q:,;, or g;, is the sum of a series of terms such as abc’ --- , 
representing the probabilities of particular outcomes. 

If we wish to find the probability Q1:,; of no occurrences of event ¢ and 
no occurrence of a point falling in zones A or B, say, we simply eliminate 
from g; those terms in which either a or b occurs. We can do this by sub- 
stituting zeros for a and b wherever they occur in g; : Qit,; = g;(0, 0, c, 
d, ---,7r). By multiplying and dividing each remaining term in g;(0, 0, c, 
d,--+,r) by (1 — a — b)’, we derive an alternative expression: 


c d 
Qing = Gi (0, 0, To, Cae 


(36) 
. pot) 0 ee b)’, 


"1—a 


showing that the conditional probability of no ¢ given no points in A 
or B uses the same function required for Q,,;. This enables us to write 
the generating function of Qi:,; as 


Cc d 


ule) = (0,0, 25, 


(37) 
a 
cS eet (ata is) 
where h(a, b, c, --- , r; s), defined fora + b-+c+ --- +r = 1, is the 


generating function of Q:,; . 

The principles are best illustrated by an example. Consider the prob- 
lem of finding /,*(y), the expected number of the trial of the first occur- 
rence of & consecutive points above Xo’ + Ly(o’/+/n) or a single point 
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above Xo’ + Ln(o’/+/n), where L;, < L,. We use an asterisk to denote 
the fact that a function applies to a one-sided test. We let a be the 
probability of a point falling above both limits, b be the probability of 
falling between the two limits, and c be the probability of falling below 
both limits. Then we substitute a + b for p, and c for q in (7) to obtain 


1— (a + b)*s* 

Riis) ost a 
Qu*(s) 1—s+c(a + b)tstt” 
Following (37), we find Q1,*(s) by substituting 0 for a, b/(1 — a) for 
b, c/(1 — a) for c, and (1 — a)s = (b + c)s for s in (89), 


Lays 


(39) 


* QS jliemaienest 
Qu*(s) [Os hoe (40) 
We set s = 1 in (40) to obtain 
¢ 1—v° 
Ey, (s) — (41) 


1— G+ 0) + oF 
The properties of any test combining 7'(Z,) with one other test whose 
properties are known can be determined in‘a similar way. 
4.6 Limits of E(y) in Compound Tests 
A development similar to that in Section 4.4 will show, for example, that 
1 1 1 1 


a a 
Eys(y) ~ Exly) — Ealy) — Es(y)’ 
where /i23(y) pertains to recurrent event €23 , whose set of outcome pat- 
terns is composed of those of recurrent events «, «, and ¢;. 
It can also be shown that 


Eys(y) S Evly) S Fly) (43) 


for example. Clearly we cannot increase the recurrence time of an event 
by increasing the different outcome patterns which define the event. 


(42) 


4.7 Translating Limits to Obtain a Selected Probability of Type 1 Error 


By supplementing 7; with other tests, we increase the probability of 
Type 1 errors. We can adjust the probability of Type 1 errors to any 
desired level by resetting the zone limits. With more than one set of limit 
lines, we have some freedom in setting the limits. A procedure that has 
the important attribute of simplicity translates all of the limit lines away 
from the central line Xo by the same amount. The properties of the re- 
sultant test can be derived directly from the properties of the original 
test. 

We first determine the properties of one-sided tests whose limits are 
translated; from these results we determine the properties of the cor- 
responding two-sided test. 
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If Qi*(Zi,, Lx,; A; 8) is the generating function of Q; for test i 
with limits at Xo’ + Ly,(o'/-Vn) and Xo’ + Ly,(o’/+/n) and with the 
process average X’ = X,’ + A, then (neglecting points from State 1): 


Q:*(Zn,, Lig} AS s) = Q.* (1, oP h, Lz 7 h; Bo h Tai 3) . (44) 


This equation says that the probabilities involved are identical if we 
translate the limits by a given amount or if we translate the process 
average in the opposite direction by the same amount. The truth of this 
stems from the fact that the probabilities depend on the position of the 
process average X’ relative to the zone limits. 

If we wish to set the limits so that the probability of a Type 1 error 
is sto, Say, for a two-sided test, we can proceed as follows: 

(1) draw the curve of L(y) versus A for the corresponding one-sided 
test (the abscissa is assumed to be in units of o’/+/n), 

(2) translate this curve to the right (or left) until H(y; 0) = 1000, 

(3) measure the amount h of the translation, and translate the zone 
limits away from (or toward) Xo’ by an amount h(o’/+/n) (control 
chart units). 
The translated E(y) versus A curve represents the new one-sided test. 
The curve for the corresponding two-sided test can be derived using 
(84); it will have a value E(y; 0) = 500. 


4.8 Monte Carlo Techniques to Determine the Properties of Zone Tests 


We can determine approximately the properties of zone tests by using 
Monte Carlo techniques on modern high-speed computers. First we 
generate a random series of numbers with a known distribution. Then, 
using the appropriate correspondence between limits within the distri- 
bution and zone limits, we translate the random numbers into a random 
sequence of zone designations, which we test for occurrences of the events 
in question. We keep score of the number of points until the event finally © 
occurs. We then start counting again as though the sequence were Just 
starting. By running through a great many cycles, we obtain an approxi- 
mation to the distribution of the cycle length y, and an approximation 
to E(y) for the particular value of A that applies to the limits we used. 
We repeat the process with different limits for different values of A. 

Within the limitations of the computer, this technique can be used 
for any zone test. We used it to approximate the properties of Ti2-4s 


for A = 0, 0 /~/n, 2(o’/+/n), and 3(0'/+/n). 
V. CONCLUSIONS 


If we supplement the standard control chart test with another zone 
test, we increase its sensitivity to process changes at the cost of more 
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frequent errors of Type 1 and a more complicated testing procedure; 
see Figs. 5 and 6. We can restore the original probability of Type 1 
errors by changing the zone limits; in the following discussion we shall 
assume that this has been done, thereby simplifying the comparison of 
various tests. We shall say that a zone test 7’, of the type we are con- 
cerned with is better than 7; for a particular value of A if Hi(y) < Ey(y) 
for that A. 

In general, the curve of Ey) versus A for a test T; is below the cor- 
responding curve for 7; for A in a range 0 < A < A,, and above for 
A > A,. The crossover point A; in the cases we considered varied from 
1.7 (0’/+/n) for T3(0.065) (Fig. 7) to over 3.5 (0’/+/n) for T12/(3.13, 2.13) 
(Fig. 9). 

Consider a test 7), that combines 7; and T; and has its zone limits set 
so that its probability of Type 1 errors is the same as for 7 and fo 
T'.. In the cases we have considered (see Figs. 8 and 9) 71, essentially 
effects a compromise between 7’, and 7’, — for small changes it is better 
than 7, but not as good as 7; ; for large changes it is better than 7’, 
but not as good as 7; for A near A; it is better than 7, and better 
than T;. 

In the cases we have considered, tests 7'(Z,-) appear to be slightly 
better than tests 7;,(Zx) for small changes. 

The reason that zone test 7; is better than 7’, for small changes seems 
to be due to the fact that it bases its decisions on a history of k consecu- 
tive points; in effect, it makes some use of a sample size kn rather than 
n. The cost of the increased effective sample size is paid during the first 
k — 1 points in State 2, where 7; has a higher probability than 7; of 
detecting a change. The probability that a point falls outside of a 3c 
limit remains fixed from sample to sample, and after the initial k — 1 
points in State 2, this probability is less than the probability that 7, 
will detect a run. Large changes are likely to be detected by 71 before 
T, becomes effective; but when changes are small the corresponding 
values of E(y) are large, and we can expect 7’, to detect the change be- 
fore 7; does (see Fig. 7). 

We have assumed that sample averages X, are plotted on the control 
chart. In light of the above discussion the possibility of pooling data 
from *& consecutive samples and plotting a statistic based on the kn 
measurements involved appears promising. 

A preliminary study of zone tests on charts of moving averages of k | 
consecutive equal-sized samples has been made. The statistic (or point) 


Veni = Cs + Xnjt + es + Xe / ke 


can easily be determined graphically in many cases. Ior example, the 


point Yo,,; is halfway between points X,,;. and X,,; on the straight 
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line connecting them — vertical rulings on cross-section paper ordinarily 
used will spot points exactly. Points Y4,,; can be similarly derived from 
points Yon j;-2 and Y»,,;. Fig. 12 shows curves of H(y) versus A for 7, 
used on points Yun j(k = 1, 2, 4); limit lines were assumed to be at 
Xo’ + 3(0’/~/kn). The curve for k = 1 is, of course, the curve 7; of 
earlier figures; the curve for k = 2 was derived using tables of the bi- 
variate normal distribution; the curve for k = 4 is an approximation 
based primarily on the results of a study making use of Monte Carlo 
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Fig. 12 — E(y) versus A for 7; applied to moving averages of k(k = 1, 2, 4) 
consecutive sample averages. — 
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techniques. We cover the possibility that the first point in State 2 will 
fall outside of its control limits by assuming the existence of prior points 
in State 1; all three curves approach H(y) = 1 as A approaches infinity. 

In comparing Fig. 12 with Figs. 7, 8, and 9, it appears that 7, used 
on moving averages provides an effective test for detecting shifts in 
process averages. Further study is required to determine the effective- 
ness of other run tests and of combinations of run tests applied to various 
moving averages. 

In summary, it is possible to devise zone tests which — within the 
constraints of our model: 

(1) indicate changes in process averages when none has occurred with 
the same average frequency as the standard control chart test 71, 

(2) detect small changes in process average— up to 1.30’, say, for 
n = 5-—sooner on the average than 7’, , and 

(3) detect larger changes inappreciably later on the average than 7, . 
Such tests require an appropriate setting of zone limits — generally at 
non-integral multiples of o’/+/n. If run tests are used to supplement 7; 
without a compensating setting of zone limits, an increased frequency 
of false indications of process changes results. 

The standard control chart test 7 (or T1(Z1)) is slightly more effec- 
tive than alternative zone tests in detecting relatively large changes; 
in addition, it has the important virtue of simplicity — a virtue that 
extends the range of economic application of 7; into areas where alterna- 
tive tests have better statistical properties. It is difficult to recommend 
a single alternative test to 7; for general application, though it is clear 
that alternative tests may be profitably used in many applications 
where early detection of relatively small changes is important. 
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A Criterion to Limit Inspection Effort 
in Continuous Sampling Plans 


By R. B. MURPHY 
(Manuscript received September 12, 1957) 


In continuous sampling plans of the type known as CSP-1, the amount 
of screening has an important bearing on the total inspection effort. To lamat 
this effort an inspector may be required to take special action of the number 
of inspected units in one screening sequence exceeds some specified value 
or “critical length’’. The aim of the special action is to bring about improve- 
ment in the production process. This effect is possible also when the produc- 
ing shop is required to do any screening called for by the inspection plan. 

A procedure for calculating critical lengths may be based on simple ap- 
proximations derived from the theory of runs. 


I. INTRODUCTION 


1.1 Continuous Sampling Plans 


The CSP-1 continuous sampling plans introduced by H. I. Dodge! 
are designed for continuous or “‘belt’’? production of discrete units of 
product. To apply such a plan, inspected units must be classified as 
either “‘defective’’ or ‘‘nondefective’’. The inspector begins by inspecting 
each unit made in succession until a specified number, 7, of consecutive 
units are found nondefective. A sequence of units so inspected is called 
a screening sequence and the number 7 the clearing number. After the 
initial screening sequence has ended, the inspector samples a fraction f 
of the units presented to him. He continues to sample until he finds a 
defective unit. At this point he again resorts to screening, following the 
same procedure as before, so that he alternates between screening and 
sampling inspection. The inspector rejects (or sets aside for correction) 
any inspected unit found to be defective and accepts all others. 

Two refinements of this plan, CSP-2 and CSP-3, have appeared” 
as well as generalizations of CSP-1°'*” entailing two or more levels of 
sampling inspection. In addition, various sequential continuous inspec- 
tion plans have been proposed.° 
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The characteristics of these different sampling plans — such as AOQL, 
fraction inspected, or characteristic curves — have been explored under 
a variety of assumptions. Of these assumptions, the statistical behavior 
of the production process has the greatest effect on the results. There are 
three alternatives which have been used (but which may not cover all 
plausible situations): 

(1) The production process is Bernoullian: each unit has the same 
probability of being defective independent of any other unit; the pro- 
portion of defective units converges almost certainly to this value as 
the number of units produced increases. It is therefore known also as 
the process average. 

(II) The production process represents a stationary Markov chain; 
each unit has a probability of being defective which depends only on the 
defectiveness or non-defectiveness of the previous k(= 1) units produced 
and is otherwise independent of time. 

(III) The production process represents a discrete stochastic process 
of an arbitrary nature. 

Not all the continuous sampling plans introduced have been examined 
under each assumption. 

Assumption (I) leads to the simplest dinthennties and will be adopted 
here. Its use does not imply that the CSP-1 plans — with or without the 
criterion proposed below — are invalid if the production process goes 
out of control. These plans were designed with this condition in mind. 
The effect of lack of control is to alter the stated characteristics of such 
plans, but the author has no evidence from actual production processes 
that such deviations are wide. ; 

Another factor that influences the characteristics of continuous sam- 
pling plans is the kind of sampling used when sampling is required. 
Again there are three alternatives commonly used: 

(i) The sampling is Bernoullian; each unit bears a probability f of 
being sampled independent of any other unit; in this case and in (iii) 
below screening is usually required to begin with the next unit after a 
known defective. 

(ii) One unit in each (disjoint) set of 1/f consecutive units produced is 
randomly chosen from the set for inspection. Screening, when required, 
may begin within the same set in which a defective unit is found, or it 
may begin with the first unit of the next set. One or the other method of 
starting to screen is usually specified. 

(iii) Every 1/fth unit is inspected. 

In most characteristics of CSP-1 it makes little difference whether (i), 
(ii), or (iil) is used provided (I) is assumed. Again the mathematics is 
simpler with (1), and accordingly we shall follow it. 
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A third assumption is sometimes made about the operation of CSP-1: 
each defective unit inspected is replaced by a nondefective one. This 
assumption affects only the character of outgoing quality. It will have 
no bearing on the criterion for inspection effort discussed below. 


1.2 Inspector’s Risk 


With this brief background we may take up the main subject of this 
paper. In using any inspection plan there are three areas of risk: one 
area pertains to the consumer’s operations, another to the producer’s 
operations, and the third to the inspector’s operations. One risk in the 
third area is that the inspector may be called upon to perform an exces- 
sive amount of inspection for the amount of protection he furnishes. 
The CSP-1 plans, although admitting the necessity of high inspection 
rates on occasion, are not really intended to be used when inspection 
will continue indefinitely at a high rate. In general such a high rate 
would not lead to economical and effective inspection nor to economical 
manufacture: screening alone does not guarantee that the level of in- 
coming quality will improve enough to diminish the amount of screen- 
ing significantly in the future. Neither is there so much confidence in 
the outgoing quality, which poor incoming quality may affect adversely 
in spite of intensified screening. Indeed, the existence of such a situation 
may imply some basic difficulty in the process of design or manufacture 
that cannot be properly handled by inspection methods alone. Not only 
the inspector but the customer may be undergoing a special risk. Fur- 
thermore, the producer often does any screening required (as Dodge 
originally recommended’). He too might find an appropriate special 
action economically preferable to a great deal of screening. 

Thus the inspector needs a special alarm signal to indicate that unless 
he takes special action a high rate of screening may continue. The fol- 
lowing sections show how such a special alarm signal for CSP-1 plans 
may be devised on the basis of the number of units inspected in any one 
screening sequence. If this number exceeds a “critical length’, n%*, 
chosen in advance, the inspector is to take an appointed special action. 

A similar type of criterion could be evolved for other types of con- 
tinuous sampling plans. The effectiveness of this type of criterion alone 
might be lessened if it were applied to other types of plans in which 
screening is not so promptly reinstated after a defective is found as in 
the CSP-1 plans. It seems certain that the ‘‘most sensitive criterion” 
for any of these plans, including CSP-1, would take account somehow 
of the observed per cent defective. On the other hand simplicity and 
convenience would have to be sacrificed to some extent to do so. For the 
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CSP-1 plans it is hoped that the proposed criterion of critical length is 
a satisfactory compromise between theoretical and practical require- 
ments. 

The special action to be taken when required by this criterion should 
depend upon the situation. It might be to notify the customer’s pur- 
chasing or contracting department; it might go so far as to cause the 
inspector to stop inspection, effectively halting purchase of product. If 
such a severe action is specified, the manufacturing unit may rightly 
feel entitled to be informed in advance whenever such action appears 
imminent so that it may begin to adjust the process and to screen prod- 
uct ahead of the inspector. Using a different criterion from the one 
proposed here, an existing government inspection plan’ does, in fact, 
require the inspector to stop inspecting. It is not our purpose, however, 
to discuss in detail any particular special action since its wisdom could 
be confirmed only by reference to the nature of the application. It is 
intended only to point out that such actions have already been devised 
and used. 

There is no reason to adjust published AOQL figures for CSP-1 plans 
because of the addition of this special action criterion to their operation. 
If the special actions are suitable, there is no reason to expect anything 
but an improvement in the outgoing quality level. 


II. THE CRITICAL LENGTH OF A SCREENING SEQUENCE 


2.1 The Basis for Choosing Critical Lengths 


It is generally possible for an inspection agency to state what it con- 
siders a reasonable upper limit to the amount of inspection it should be 
required to perform under a given CSP-1 plan. Let us call this limit F’*. 
Under our assumptions Dodge’ has shown that, when the probability of 
a defective unit is p, the average amount of inspection (i.e., limiting 
fraction of units inspected) is 


f 
F= a (q=1-p), (1) 
Ye oe a 
if an inspector uses a CSP-1 plan with clearing number 7 and sampling 
frequency f. It is clear that placing an upper limit F* on F is equivalent 
to placing an upper limit p* on p. Indeed, if 





(1 — pt)’ = ——. = k, (1') 
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the inequality / < F* implies and is implied by p S p* according to 
(1).T 

Having specified #* as the upper limit to the amount of inspection, 
we need a measure of the price the inspection agency should be willing to 
pay to enforce it. J°or our purposes it will be convenient to choose as a 
measure the maximum probability a* of taking special action when 
F s F*. It is equivalent to say that a* shall be the fraction of all screening 
sequences in which the inspector takes special action when F = F* or 
when p = p*. In practice the choice of F* or a* or both may be some- 
what arbitrary. In the author’s experience, the choice of F* = 0.5 and 
a* = 0.10 has proved reasonable. 

We may now choose a critical length n* so that special action is’taken 
in accordance with the risk specified above. First, the inspector is to 
take special action whenever a screening sequence has not terminated 
after the n*th consecutive unit in the sequence has been inspected. Sec- 
ond, n* is to be chosen so that when F = F* the fraction a* of all screen- 
ing sequences have not terminated after n* units. 

This second condition cannot in general be fulfilled exactly. Instead, 
if we call the probability that a screening sequence has not terminated 
after n units 7’,(p, 7), we shall find n* satisfying 


Tne(p*, 1) So <Twalp*,i). (2) 


It can be easily demonstrated that for any a* and p* satisfying 
0<a* <land0 < p* < 1, there is a solution, n*, to (2). 

It is sensible to desire that the higher the “true” limiting fraction F 
of units inspected, the more likely it is that a screening sequence will 
exceed its critical length. The truth of this statement can be easily 
shown also. This guarantees that a* is a maximum probability of taking 
the special action incorrectly. 

The mathematical problem, as we have stated it, is covered by the 
theory of runs. Its solution has long been known’ and will be discussed 
in the following section, as well as in the Appendixes. Briefly, in terms 


} Certain variations of CSP-1 lead to different expressions from those given 
here. For instance, if the producer does all screening, the inspector will often in- 
spect a fixed proportion, f, of all units — including those already screened. It is 
then more sensible to apply F* as an upper limit only to the average amount of 
screening, which is the product of 1 — g‘ and the right side of (1). Solving for p* 
or K then requires that f be added to the denominator (1 — f)F* in (1’). Another 
variation arises when defective units inspected are repaired and reinspected. If it 
were assumed that the proportion defective among repaired units is again p, it 
would be necessary only to divide the right side of (1) by 1 — p and to replace 
F* by F*(1 — p*) in (1’). Then p* or K would be found by iteration. The effects 
of these two variations may be combined but not without further assumptions. 
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of this theory, we may restate our problem as follows: Given a Bernoul- 
lian process with “success” probability g* = 1 — p*, to find the least 
number of trials, n*, in which the probability of having had no runs of 
~ (or more) ‘‘successes”’ is less than or equal to a*. 

Common sense demands that the special action never be taken until 
there has been some chance to complete the screening sequence. That 
is, the critical length of a screening sequence must be larger than the 
clearing number. It is shown in Appendix A that it is equivalent to re- 
quire 

* ier hi 
an (1 — f)F*’ (3) 
This restriction is usually minor. For instance, if, as above, F* = 0.5 
and a* = 0.10, then according to (8) f < %9 = 0.474". For a* < 0.5 
it is more convenient generally to use the inequality 


F* > f + 0.700%. (3) 


As is shown in Appendix A, (8) is satisfied whenever (3’) is. 

In any case the value computed for n* will depend upon the assump- 
tions discussed in Section I. If these are inexact, the probability state- 
ments outlined above will generally be inexact also. Nevertheless, the 
same value of n* may still be used with good prospect of limiting screen- 
ing effort without added penalty to the manufacturer. 


2.2 Computation of n* 


As noted above, the exact relation between n*, 7, p* and a* is known, 
but it is difficult to use. To simplify computation it has been found ad- 
visable to resort to an approximation for n*, which assumes the form 


n* = ait a, (4) 


where a and a; depend only on K, defined in (1’), and e*. A derivation of 
this approximation is given in Appendix B. It is based on asymptotic 
results for large n* and 7. 

It is interesting to note that K, when defined in terms of p* and 2, 
is the probability of terminating a screening sequence in exactly 7 trials. 
For the purposes of this paper we shall determine K in terms of f and F*. 

For convenience the coefficients a) and a; are presented in graphical 
form in Fig. 1 with values of K on the abscissa and with separate curves 
for a* = 0.01, 0.05, 0.10. The requirement (8) is observed by plotting 
these curves only over the interval of values of K _ satisfying 
a®* <1 — K < 1. While the immediate field of interest is inspection, the 
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Fig. 1 — The coefficients a9 and a; as functions of K = f(1 — F*)/F*(1 — f) for 
a* = 0.01, 0.05, 0.10. The critical length n* is approximated by aii + ao. 


values of a and a; read from this chart obtain equally well in other uses 
of the theory of runs. Therefore, the range of K in Fig. 1 is consider- 
ably larger than would be necessary to handle this particular problem 
alone. Given the values f, 7, F* and a*, the value K may be computed 
from (1’). If a* = 0.01, 0.05, or 0.10, we may choose the proper curve 
for a) , read off its ordinate at the computed value of K, follow a similar 
procedure to find a; , and compute n* from (4). 
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In order to compute the coefficients ay and a, for any values of a* and 
F* satisfying (8), it is necessary to compute 


AES 
w= Sine Soy a eh f 


{| — ies: (5) 





and to solve 
we” = ve’, (v ~ wforw # 1), (6) 


for v, the letters “In” indicating the logarithm to the base e. It is usually 
easiest to solve (6) by the following convergent iterative procedure: 
If w = 1, put | 


Uo = we, Um41 = Ue”; (7) 


IA 


if w S 1, put 


vw = w—Inuw, Um41 = % + Invy. (7’) 


In either case v may be obtained with as much accuracy as desired by 
simple iteration with formulas (7) or (7’). 
The coefficients a) and a; then may be expressed as 


1 w—v wa* 
a= 1 fmt me, | (8) 
w—v v+w-—2 
7) ape (9) 


The limiting values of a; and aj as w and v approach unity are given by 

(B9), (B11), and (B12) in Appendix B. 

The accuracy of the approximation (4) has been investigated and found 
to be adequate. For small 7 and f and large F* slightly greater precision 
is possible with the Uspensky approximation’, computation of which is 
simplified by Feller’s iterative procedure” (see Appendix B). Both ap- 
proximations lose accuracy as a* increases. . 

‘ For F* = 0.5 and a* = 0.1, Table I presents a comparison of the exact 
integral value satisfying (2) with the two approximations, in which the 
value of n* satisfies the equation concerned as precisely as possible and 
is therefore not integral. For this table the exact recursion formulas (A8) 
and (C3) were used. The latter was found by Miss M. N. Torrey and 
the writer and appears to be new. 


ap = 


2.3 Some Properties of the Criterion of Critical Length 


According to the previous discussion, it is proposed to take special 
action whenever a screening sequence exceeds its critical length. Since 
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Tasie I — Tasie or Crirican Numprers n* For Uprrr Limit oF 
Fraction Inspectep f* = 0.5 anp Maximum PROBABILITY 
oF Error a®* = 0.1F 


























Sampling Frequency, f 
Clearing 
Number, 7 
0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 
5 88 47 32 24 | 19 15 12 10 8 
87.8 46.6 | 31.5) 23.4] 18.2) 14.5 11.7 9.4) 7.4 
85.7 45.9 31.2) 23.3 
10 153 84 58 44 | 85 | 28 23 19 16 
152.2 83.3 57.5| 48.5) 34.4; 27.8 | 22.8 18.6) 14.8 
151.2 83.0—| 57.4 
20 283 158 110 84 | 67 | 55 45 388 | 31 
282.5 | 157.3 109.8} 83.7] 66.7] 54.4 | 44.8 | 36.9] 29.8 
282.0—| 157.1 
50 675 380 267 | 205 /164 /135 111 94 | 75 
oe 379.6 | 266.8) 204.5/163.6/134.1 |111.0+] 91.9] 74.6 
674. 
100 1329 751 529 | 406 (826 /268 221 187 {150 
1329 750.3 | 528.6} 405.8/325.2/266.8 |221.4 |183.5)149.3 
1329 750.2 267 .0— 
300 3946 2233 1576 {1212 |973 (800 661 560 (450 
3947 2234 1577 |1212 |970.9,797.0—|662.7 |550.1/448.0-- 
3946 2233 1576 |1212 971.9 798.4 |662.9 














} The triad of numbers appearing for f and 7 combinations are, reading down, 
the exact value, the Uspensky approximation, and the approximation (4) to (9). 
The last is omitted if it agrees with the second to 0.1. Approximate values less than 
1,000 should be rounded to the next higher integral value to obtain the result cor- 
responding to the exact value of n*. This method was followed in rounding approxi- 
mate values greater than 1,000. 


the aim of such action would be to bring about improvement in the proc- 
ess, it might be justifiable to resume inspection with sampling after the 
special action has been taken. There is a question in any case whether 
the screening sequence, once interrupted by special action, should be 
resumed at the point it was stopped. A cautious procedure would be to 
resume inspection with a new screening sequence not involving any pre- 
viously inspected units. This course would lead to a lower AOQL but a 
higher fraction inspected, f°, than the original plan.{ Resumption with 
sampling would have the opposite effect, but the changes in either case 
should be slight in practice. 

We shall consider in detail only the effect of increasing the total amount 
of inspection when inspection is resumed with a new screening sequence 
after special action has been taken. With this alteration in the CSP-1 
inspection plan, the limiting fraction inspected, F°, according to our 

{There is no change from the original values if the inspector takes special 


action as soon as he finds a defective unit after n* — 7 units in a single screening 
sequence and before that sequence is ended. 
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mathematical model, is given by (D24) in Appendix D. Furthermore, 
the upper limit to this fraction is according to (D34) 


[r* 


Ws cae a ee 
ee ae 


(10) 


where 

B* = (1 — a®)/(L — a®K — Treii(p*, 1) (11) 
and all other quantities are as defined previously. The comparatively 
minor change from F’* to F™ in the range of interest can be illustrated 
by noting that if F* = 0.5, a* = 0.1, and f 2 0.05, we will have 
0.5 < F* < 051. 

Under these same assumptions two other characteristics of the modi- 
fied CSP-1 plan can be readily computed: The average number of special 
actions per 10,000 units produced and the average number of special 
actions per 10,000 units inspected. These may be computed by multi- 
plying C in (D27) and C’ in (D32) respectively by 10’. The first of these 
two averages may be the more useful to the practitioner, who can use 
the value of this average at p = p* as an added measure of the price 
paid for using the criterion of critical length. In some cases he may prefer 
it to a*, For F* = 0.5, a* = 0.1, and p = p* this number varies from 
about 0.4 for f = 0.05 and i = 5 to about 15 for f = 0.45 andz = 100. 

Another more theoretical use may be made of these two averages. 
We may wish to compare the operation of the criterion of critical length 
with that of any other criterion adopted for the same purpose. The 
parameters of the criterion to be.compared to the present one could be 
adjusted so that one or both of these two averages agree for the two 
schemes when p = p*. Then the average number of special actions per . 
10,000 units produced could be plotted against p or F in both cases. On 
the other hand one may wish the fraction inspected to be the same at 
p = p* for the two schemes. However, criteria calling for special action 
at certain times when the last inspected unit was defective lead to the 
same fraction inspected as found in the original CSP-1 plan. In such 
cases it is not possible to obtain equal fractions inspected, since ’'° > F. 
It appears better in general to deal with the two averages C and C’ 
for the purpose of comparing criteria. At any event, as has been men- 
tioned above, such formal comparisons are not complete measures of 
practical value in themselves. 
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APPENDIX A 


Some Properties of the Run Probability T,(p, 7) 


As before let T,(p, 7) be the probability that a screening sequence 
with clearing number 7 and process average p has not terminated after 
the nth consecutive unit has been inspected. This is the same as the 
probability of no run of 7 or more ‘‘successes” each having probability 
q = 1 — pin n independent trials. Except when necessary for clarity 
to do otherwise, we shall abbreviate 7',(p, 7) by 7’, . 

It is easy to see that 


T,=1, n=0,1,:::,i—-1, (Al) 
T:=1-—q,q@=1- p), (A2) 
Tria -Tn=peTreina, (n>). (A3) 


From these relations it appears that the generating function of 7, , 
Ta) = doe Te, 


satisfies 





ers g'z' 

T = _ A4 
in which both numerator and denominator have the common factor 
Li ge, 

If it is required that 7’; > a*, we have directly from (A2) 


1 — gt > aX. (AB) 


From (1’) and (A5) the inequality (8) follows. In turn F* — f is seen 
from (3) to exceed a*f(1 — f)/[1 — a*(1 — f)]. Maximizing this quan- 
tity with respect to f, we have 2(1 —~/1 — a*)/a* — 1, which is less 
than a* for 0 < a* < 1. This maximum and its derivative with respect 
to a* are increasing in this interval, and the former assumes the value 
3 — 4/2? < 0.35 at a* = 0.5. The inequality (3’) follows immediately. 
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APPENDIX B 


Derivation of an Approximation for Critical Length 


From the expansion of (A4) in partial fractions, Uspensky has shown 
that as n approaches infinity, 


ee 1 — gé 1 
a ee ve 
with the unique positive root of 


i—1 


YY @ ss. (B2) 


s=0 pe 


The Uspensky approximation for 7’, leads to the approximation (4) 
for n* satisfying (2) for any given a* (0 < a* < 1), 7, and p*(0 < p* < 1). 
It can, in fact, be shown that 


n*~woitataw +a? t+ ::: 


If in (B1) and (B2), we put p = p*, gq = g*, and n = n’*, it follows 
from (1’) and (B2) that 


1— Ké 


Tnx ~ GErep : (B3) 
Likewise, making the same substitutions in (B2) it follows that 
Ka — K™)c* ~2 +1=0 (B4) 


has two and only two positive real roots, 71 = 1/g* and x, = é. 

We shall consider a system of equations in five variables equivalent 
to the system (B3) and (B4) in the five variables K, 7, &, n*, and Ty. 
We shall call the new variables w, z, v, ¢, and a*. The new system of 
equations is 


1 ed = wey 


Qa 
l1—v — vz 


(1 = va)’, (B5) 


and 
ev" —e”)(1 — vz)” = vz, (B6) 


where only finite positive values for all variables are going to be con- 
sidered with ve < 1 and 0 < a® < 1. If in (Bd) and (B6) we put 
w= —-n K,2=7 ,v=i§ —1)/t, 9 =7 (n* +1), and a* = Tn, the 
result is (B3) and (B4) with the symbol ‘“~” replaced by “=” in (B3) 
and with « = £in (B4). © 
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Like (B4), (B6) has one “extraneous” root, 1:(w, z) = (1 — e ””)/z. 
Teither positive root, v:(w, 2) or v2(w, z), is such that there exists a finite 
function v.o(w) = lim v,(w, z), (s = 1, 2). Indeed, ve(w) = w and 


Vo9(w) = vo(w) ee the limiting form of (B6) given by (6). Clearly either 
0<wZi1 Sv (w) < »~ or0 < »(w) S$ 1 Sw < o, the equality 
signs holding simultaneously. 

Taking logarithms of both sides of (B5), we have 


e= (net 4- in| 5 er ese a ee are = aE al) /? In(i — v2). 


As z approaches zero, ¢ approaches 


vow, a*) = aa a (Im @* + In je = mw) TN (B7) 


w — vow) 


We may differentiate (B6) and (B5) to obtain respectively 











dv(w,z)| —__ vo(w) (vo'(w) — w) 
02 2=0 2 (1 — v(w)) 
and . 
* dg 
Yo (w, a*) = 
02 z=0 
(B8) 
= 0 = Ml) ap, att) — Pole) tw = 2 
21 ww) 21 — w(w))? ” 
Substituting the values w = —In K,z = 7, andg = 7 (n* 4+ 1) in 
e = gow, a*) + zg'(w, a*) + --- and putting 
vo(w) = Gs a = gow, a"), a = go (w, a) — 1; (B9) 


we have the approximation given by equations (4) through (9). 
If both sides of equation (6) are divided by d = w(w) — w, we get 


w = d/(e* —1). (B10) 

From (B7), (B8), and (B10) we find 
go(w, a*) = In2 — In a* + 0,(d), (B11) 
go (w, a*) = go(w, a*) — 3 + 02(d). (B12) 


_ The related approximation for 7, , as n and 7 approach infinity, is of 
the form 


Tn er Agnes (B13) 
where A, B, C, and D depend only on q’. 
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APPENDIX C 


A Recursion Formula for T,(p, 1) 


In order to investigate the error in n* computed from the Uspensky 
approximation (B1) or the approximation (4), a convenient form of 
the exact value of 7,(p, 7) was needed. Such an expression is 


min (k,r) 


‘ie a 2d, (=1)'@) (pq°)° T's)is (Cl) 


where n = ki + randk = 1and0 Sr S 1. This may be established 
~ easily by an inductive argument. Indeed, if k = 1 and r = 0, (Cl) 
yields an identity. By adding successive expressions of the form of 
(A3), we obtain 


fines = Ti - py’ > St rer nee (C2) 
For k = 1, (C2) yields with the aid of (A1) 
T isn = Ti — rpg’, 


substantiating (C1) for k = 1 and 1 S r S 7. Next we assume (C1) 
to be true for some k = land some r, (0 Sr S 7). We wish to show that 
(C1) is true for n = (k + 1)d + r. From (C2) and the induction as- 
sumption 


r—1 min(k,s) 


Taster = Topi — pg 2 d (—1)'(:)(pq') Tas): - 


If the order of summation is reversed, the double sum becomes 


: min(k,r—1) ; r—1 
— pq pe (—1)'(pq')' Tuoi 2 (:) 
t= s= 
min(k+1,r)—I1 


= » (—1)°"'(pq') Pa i(e41), 


t=0 


so that 


min(k+1,r) . 
T epayigr = 2d, t=1)"G) (pq')’T (41s) 
The special form of (C1) used in checking accuracy was that with 
redgand| Sk =12: 


Topni = > (—1)°() (pq')°T es) « (C3) 
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APPENDIX D 


Some Characteristics of the CSP-1 Plan With and Without the Criterion 
of Critical Length 


The use of generating functions is helpful in characterizing the original 
CSP-1 plans. The theory of Markov chains, applied somewhat as in 
Reference 4, also leads to some of the results found here. While this 
theory is convenient to show the validity of the strong law of large 
numbers as applied to fraction inspected and other ratios, the task of 
computing (to which we restrict ourselves here) appears generally 
simpler with the generating function technique. Let P, be the probability 
that the rth unit produced is the first one in some sampling period, and 


let Q, be the probability of being in a sampling period on that unit. Then 
P,=0,08r8), Pu=q, 
7 | (D1) 
P,=qopil-(1—-f)Q-—al, (>it), 


and 
Or =) ao Pl — fp), (r = 0). (D2) 


If P(x) and Q(x) are the corresponding generating functions, we have 
from (D1) and (D2) respectively 





a tg’ 
P@) = 2-L ft -—q@— pd —f-He@) — (D8) 
and 
Q(z) = P(x)/[l — C — fp)z, (D4) 
whence 
tit i-1 at 
o@) = |1-va-peDa@|. os 
With some manipulation of partial fractions we obtain 
a tH qd — x)" ~ r 
Q(x) = qa | a a —fe I > ea |, (D6) 


where it can be shown that e, approaches zero as r approaches infinity. 
It follows that 


Q = lim Q,; 


= qg 
eS (rapier 2D 
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and, therefore, the limit of P, exists as does the limit of 
F,=1—-(1 —f)d-, (D8) 


the probability that the rth unit produced will be inspected. From (D7) 
and (D8) we may obtain (1). 

There are similar results in terms of units inspected. Let P,’ and Q,” 
correspond to P, and Q, with r in the former pair indicating the ordinal 
number of the unit inspected. Thus 

Py = O10 S7-s 7%), Pi’ = q, 


(D9) 
P, = ¢p, (r>i+ 2D, 


and 


Q; =0,0srs2), 


(D10) 
Q," = Ds Pod _ p)'* = q ete 1). 


Before passing to the modified CSP-1 plan, we observe that we may 
write the expression for P, in (D1) and (D8) in a different way. First, 
let R, be the probability that screening is stopped for the first time after 
the rth unit produced (i.e., a run of 7 nondefectives has been completed 
for the first time with that unit). The generating function of R,, R(x), 
then satisfies 


R@) =1- 0-2) 7@ = Re, ow 


where 7'(x) is given by (A4). Next we may put 


P, = Rea t+ fp(Qoka + +++ + Qralto), (D12) 
or 
P(x) = cR(x)(1 + fpQ@)), (D13) 
and 
Q(x) = xR(x)/[1 — « + fox — R(z))). (D14) 


We now take up the case in which a criterion for the length of screen- 
ing sequences is applied in the operation of the CSP-1 plan. ‘To simplify 
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notation a little, we will hereafter call the (fixed) critical length n rather 
than n*, Also the probabilities corresponding to P, and Q, will be de- 
noted by P,° and Q,°. We shall treat the occurrence of an incomplete 
screening sequence of length n as a recurrent event; as soon as such a 
sequence occurs, the whole inspection procedure begins anew with the 
next unit produced. 

Now let R,° be the probability that screening is stopped for the first 
time after the rth unit produced. Then 


Rey SS OR aera: (D15) 
and the generating function is 
RY(a) = R¥(a)/(L — Tre”), (D16) 
where 
PG) => rake. (D17) 


If the superscript C is attached to the symbols P, Q, and #& in (D2) 
and (D12), we arrive at valid equations. Therefore, we may write down 
the generating function Q(x) by placing the superscript C on the same 
letters in (D14). Using the identity 


R*(z) = 1 —-— Tax” — 1 — 2)T* (a), (D18) 


where 
T*(x) = Dore Tt", | (D19) 
we have 
O°) = EO fy — ra" + por). (D20) 


Again we find by the use of partial fractions 


Cc = * qd Ao a) ~~ tr 
Q(x) = aR (x) [Sa + > Cr X | (D21) 


where it can be shown as before the e,’ approaches zero as r approaches 
infinity. Hence, 
1 — T,, 


Cy: Ores 
Q ae Qs 1 Ta + foT*)’ (D22) 
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and the limit of the probability of inspecting the rth unit produced, 
Ee aq, (D23) 


ro = tmp’ — fh = Pe) + fpT*0) 


me 1— f, + fpT*() ~ Oe) 


We may likewise compute the probability C, that the criterion of 
critical length will be applied on the rth unit produced: 
C; — D, + (pOrsea Cen” ss eae = Ona Te = Gu) (D25) 


where D, = 7," if r = nk and is zero otherwise. The generating function 
is 





C@) = "Fe a - (1 + fpQ°(2)), (D26) 
— nv 
so that 
elas as foTn Cc foT n . 
COT a tiene pea, 


As with the original CSP-1 plan we may find the probabilities P,”° 
and Q,'° corresponding to P,° and Q,° in terms of inspected rather than 
produced units. We may also obtain C,’, the probability of applying 
the criterion on the rth unit inspected. The generating functions are 
easily seen to satisfy 


PI(a) = aR) + pO), (D28) 
Qe) = PI(a)/(1 — a2), (D29) 
and 
Oe) = 0 + 00"). (D30) 
Ta 


We find, using the previous methods, 


1 4: Ie. ae 
ee ee ar 4 oP)’ a 
C= lim G/ = pes (D32) 


r->0 Lae pe) 
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It is possible to obtain the limiting probabilities (D27) and (D382) 
indirectly as the reciprocals of expectations of recurrence times, although 
the method of generating functions allows a more complete characteriza- 
tion. If m is the number of units produced until the criterion of critical 
length is applied and m; is the number of units inspected until the same 
event, it is interesting to note from (D24), (D27), and (D32) that 


F° = C/C’ = E(m)/E(m), (D33) 


where E is the expectation operator. 
The values of these same limiting probabilities are of particular in- 
terest when p = p*. Then 7, = a* and 


pre ye SIs Sl aes 


assuming an exact solution to (2). From (D24), for instance, we have 
for p = p* 


a 
es PY = FEO Re oe 


and 
Cc’ = C™* = a*g*p*K/(1 — a*), (D35) 


where @* is given by (11). The denominator of 8* can be fairly well 
approximated by 


1—oa(K+e°), 


where v is defined by (6). Finally (D33) may be used to find C*. 
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Nonparametric Definition of the Repre- 
sentativeness of a Sample—with Tables 


By MILTON SOBEL and MARILYN J. HUYETT 
(Manuscript received October 4, 1957) 


The problem is to determine how large a random sample ts needed in 
order to attain a preassigned probability P*(4 S< P* < 1) that the sample 
will possess a certain amount (or degree) of representativeness of the true 
unknown (cumulative) distribution F under study. The definition of repre- 
sentativeness involves two preassigned constants k and B*(k 2 2 is an 
integer). For example, for k = 2 and any B*(O < 6* S 4) the sample 
as defined to be representative if the proportion of the total sample size fall- 
ing on each side of the population median differs from 4 by at most *. 
In this case the degree of representativeness 1s defined as d,* = 1 — 26*. 

This idea can be extended to any number k of disjoint, exhaustive cells 
equi-probable under F; tables and graphs are given for finite and infinite 
populations for selected values of k, B* and P*. The definition is also 
extended to cases in which the experimenter is particularly interested in 
parts of F which are not equi-probable and/or parts of F which do not ex- 
haust the whole sample space; tables and graphs accompany each applica- 
tion. 

These results are non-parametric, 1.e., if the prescribed sample size is 
used then the experimenter’s requirements for representativeness will be 
satisfied whatever the unknown distribution. Derivations of exact and ap- 
proximate formulae used in computing tables are given in the Appendices. 


I. INTRODUCTION 


This paper deals with the problem of determining how large a random 
sample is needed in order to guarantee with preassigned probability P* 
that the sample will have a specified amount (or a specified degree) of 
representativeness of the true, unknown (cumulative) distribution F 
under study. No a priori information is given about F and no assumptions 
are made about the form of F. The solution given is nonparametric (1.e., 
distribution-free) so that the results obtained and the tables and graphs 
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constructed are valid for any true underlying distribution. The case of a 
finite population as well as that of an infinite population is considered; 
in the latter case it is assumed only for ease of exposition that those 
percentiles of / which enter the discussion are uniquely defined and have 
probability zero under IF’. (This will, in particular, be the case when 
has a density function without zero-stretches between points having 
positive density.) 

A definition of representativeness (and also a degree of representative- 
ness) is given with respect to those parts of Ff which are between certain 
percentiles which we denote by F'(p,), the values of p; being pre- 
assigned. The intervals between these percentiles will be called cells and 
we shall only consider collections of pairwise disjoint cells. For example 
the experimenter may want to guarantee with probability at least 
P* = 0.90 that between 40 per cent and 60 per cent of his sample will 
lie on each side of the population median. In this case we are interested 
in the part of F (or the cell) between F~'(0) and F~‘(0.5) and also the 
part of F (or the cell) between F~’(0.5) and J” “(1). By the definitions 
below the common allowance {* is 0.10 and the degree of representative- 
ness d,* is 0.80 (or 80 per cent). Then we enter Table I (or IT) with 
k = 2, P* = 0.90 and 6* = 0.10 and find that the smallest sample size 
needed to satisfy the experimenter’s requirement for representativeness 
is n = 60. (It is instructive to note that the same solution would 
hold for any two disjoint, exhaustive subsets of the sample space having 
a common probability of 4 under F. However, the cases in which we 
consider disjoint cells and, in particular, disjoint cells which start 
from one end or both ends of the distribution are of considerably more 
practical interest. The cell terminology will be used in the body of 
the paper while the subset terminology will be used in the appen- 
dices.) 

In the above example the sample space is broken up into two dis- 
joint, exhaustive cells which are equi-probable under I’. This idea of rep- 
resentativeness can be extended to any number k of pairwise disjoint, ex- 
haustive cells equi-probable under F and in the numerical work the values 
k = 2,3, 4, 5 and 10 are considered. The idea of representativeness can 
also be used with cells that are not equi-probable and/or with cells that 
do not exhaust the whole sample space. As an example of the first type 
(cells not equi-probable) we might be concerned about whether a sample 
is large enough to be simultaneously representative of a single tail with 
preassigned probability p < 4 under F and of its complement which has 
probability (1 — p) > 4 under Ff. As an example of the second type 
(non-exhaustive cells) we might be concerned about whether a sample 
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is large enough to be representative o both tails (each having (say) a 
common preassigned probability p < 4 under F’), without any concern 
about the middle cell between the ie tails. For each problem tables 
and graphs throughout this paper give the smallest required sample 
size for selected values of P* and specified amounts (or specified degrees) 
of representativeness. 

Assuming for the moment that the density of F is known and that all 
of its deciles are finite then we can plot an observed bar diagram (ie., 
rectangles with different widths under the dashed lines in Fig. 1) and 
the true density on the same diagram as shown in Fig. 1 to illustrate the 
idea of a representative sample. By definition of a decile each of the ver- 
tical strips bounded above by the curve has an area (or probability under 
F) of 0.1. The observed sample is considered representative relative to 
this pattern of ten disjoint, exhaustive and equi-probable cells to within 
a common allowance 6* if simultaneously the areas of all vertical rectangles 
differ from the theoretical value of 0.1 by at most B*(0 < 6* S 0.1). 
Then the degree d,* of representativeness as defined in Section III is 
equal to 1 — 108*. We are interested in finding the smallest sample size 
needed to guarantee a probability of at least P* that the above condition 
will hold in a sample drawn at random from F. 

This problem is related to the well-known problem! of Koinieeorow: 
Smirnov since they both have the common purpose of determining 
the sample size required to obtain a representative sample. Since their 
definition of representativeness is different from the one treated here, 
it is difficult to make a proper comparison of the two procedures. Another 
remark on this comparison is made in Appendix IV. 


II. DEFINITION OF REPRESENTATIVENESS 


Let Ff denote the true unknown cumulative distribution and let F,,* de- 
note the observed sample distribution based on n observations. For any 
given k let C1, Co, +--+ , C, denote pairwise disjoint cells (not necessarily 
exhaustive or equi-probable under F) which are defined by certain per- 
centiles. The cells C) , C2, --- , C, are not known but their probabilities 
under F' are given positive numbers; let F(C’;) denote the probability 
assigned to C; by the distribution F(¢ = 1, 2, --- , k). (We are using F 
and F,,* as symbols for both point functions and probability measures 
which are set functions; clearly, the nature of the argument will prevent 
any confusion.) Let 6;* denote specified positive numbers (which we shall 
call allowances) such that 


0<6* S F(C) @@ = 1,2,---,k). () 
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We shall be particularly interested in the special case 6;* = 6.* = --- 
6.* = B* (say), whether or not the quantities /(C;) are all coat ‘Then 
a sample is defined to be representative relative to a fixed pattern of 
k disjoint cells Cy , C2, --+ , C; to within the allowances 6;*, B.* --- , Bx*, 
respectively, if we have simultaneously 


|F.*(C) —F(C)|S6* @=1,2,---,k). (2) 


III. DEFINITION OF DEGREE OF REPRESENTATIVENESS 


Although the quantities 6,*(¢ = 1, 2, --- , k) are basic to the idea of 
representativeness it may be useful, in a given problem, to combine them 
to define a measure of the degree of representativeness. We define 


mL - res ]} @ 


where the subscript g denotes the fact that d,* is a geometric mean. It 
follows from (1) that 0 S$ d,* < 1 and that d,* can take on all the values 
in this interval. 

It should be noted that for any fixed set of values of F(C;) 
(¢ = 1,2, --- , k) if there is a common 6* then the right hand member of 
(3) is a strictly decreasing function of 8* for B* < min F(C;). Hence, if 
there is a common 6* the values of d,* and 8* uniquely determine each 
other. When this is the case we may be interested sometimes in specify- 
ing d,* (instead of 6*) and then using (3) to solve for the common £*. 

We shall say that a random sample is representative relative to a fixed 
pattern of k disjoint cells Ci, C2, +++ , Cy to a degree d,* if for the com- 
mon B* = 6*(d,*) satisfying (8) we have 


|F.*(C) — F(C)|S6* @=1,2,---,h). (4) 


It should be emphasized that the chief interest of this paper is in the 
concept of representativeness as formulated in Section II and that the 
present definition of the degree of representativeness is to be regarded 
as supplementary. 

One possible criticism of the definition of d,* is that it may require a 
positive (and sometimes substantial) number of observations to attain a 
zero degree of representativeness (see, for example, the last and third 
from last columns in Table III). However, since the practical use of the 
concept of degree of representativeness is mainly for large values of d,* 
this objection is not serious. 
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It is possible also to define the degree of representativeness as an 
arithmetic mean d,* of the bracketed quantities in (8) but then for a 
common * and different F(C;), because of (1), the value of d,* is re- 
stricted to an interval J S d,* < 1 where J is positive and depends on 
the values of the F(C;) (@ = 1, 2, ---, k). Clearly, if the F(C;) are all 
equal and there is a common §* then d,* = d,*. 


IV. CONSTRUCTION OF TABLES 


The problem is to find the smallest sample size n such that the joint 
probability of all the inequalities (2) [or (4)] is at least equal to a specified 
value P* < 1, 1e., such that 


P{ | FX(C) — FC) |S 8" =1,2,-°,H)=2P% (5) 


The reader is cautioned that it does not necessarily follow that (5) 
holds for any integer greater than n; however, since F,,* converges almost 
certainly to F (see page 20 of Reference 2), it follows that there exists 
in each case a smallest number n’ = n such that (5) holds for every 
integer greater than or equal to n’. For example, with k = 2, a common 
g* = 0.20 and P* = 0.75 the condition (5) is satisfied for n = 3, for 
6 and for any integer greater than or equal to n’ = 9. 

Since the cells C; are pairwise disjoint and the values of /'(C;) are given 
(¢ = 1, 2, --- , &) the left member of (5) is determined for any particu- 
lar sample size whatever the unknown distribution F’, In the case of an 
infinite population we use the multinomial distribution with k or k + 1 
disjoint cells depending on whether or not the k disjoint cells are exhaus- 
tive, i.e., on whether or not >-i..F(C;) = 1. For the case of two dis- 
joint, exhaustive cells this clearly reduces to a problem of the binomial 
distribution which is closely related to the problem of finding confidence 
limits on a population percentile by the use of order statistics. Similarly 
in the case of a finite population we use the hypergeometric distribution 
with k or k + 1 categories depending on whether or not )-fF'(C;) = 1. 
The exact and approximate formulae for computing the left member of 
(5) are given in Appendices I and II, respectively. The approximate cal- 
culation involves several interesting geometrical digressions which are 
discussed in Appendix III. 

Table I gives for k = 2 and selected values of 8* and P* the required 
sample sizes n and n’ and also the maximum drop in probability below 
the specified P* for all sample sizes between n and n’. In the remaining 
tables only the values of n are given. Table II gives the required sample 
size fork = 2, F(C1) = p, F(C2) = 1 — p for p = 0.5, 0.2 and 0.1 (for 
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TABLE I 


Sample size required to attain a probability P* that a sample will be 
simultaneously representative to within a common allowance 6* of two 
disjoint and exhaustive cells separated by the median for any true dis- 
tribution. 

In each set the first entry is the smallest sample size required to satisfy 
(4); the second entry is the smallest size required such that for all 
sample sizes at least as large, (4) is satisfied; the last entry is the maxi- 
mum deviation in probability below P* obtained for all sample sizes 
between the first two entries. 






































p* B* 0.01 0.05 0.10 0.15 0.20 0.25 0.40 
0.50 1051 31 5 5 2 2 2 
1199 59 14 10 5 2 2 
(0.0264)| (0.1271)| (0.2266)} (0.1875)} (0.1250)} (0) (0) 
0.60 1700 60 & 5 3 3 3 
1850 79 «|. 24 10 8 3 3 
(0.0162)| (0.0704)} (0.3266)! (0.2875)} (0.2250)} (0) (0) 
0.70 2600 100 20 8 3 3 3 
2750 119 29 16 8 6 3 
(0.0124)| (0.0382)| (0.1049)| (0.2078)! (0.3250)| (0.0750) (0) 
0.75 3251 120 25 11 3 3 3 
: 3399 150 39 16 9 6 3 
(0.0077)} (0.0407)) (0.0769)! (0.1377)| (0.3750)| (0.1250)| 0) 
0.80 4051 151 35 14 9 4 4 
4199 179 44 24 12 7 4 
(0.0058)| (0.0328)| (0.0430)} (0.0518)! (0.0266)| (0.0750)| (0) 
0.85 5100 191 45 17 10 4 4 
5250 219 BA 27 15 10 4 
(0.0052)| (0.0269)! (0.0434)! (0.0879)| (0.0766)} (0.1250)| (0) 
0.90 6700 260 60 28 13 8 5 
6850 279 74 33 18 1 5 
(0.0029)! (0.0129)} (0.0299)| (0.0360)! (0.0796)! (0.0797); (0) 
0.95 9551 371 90 37 20 12 6 
9699 399 99 47 28 15 6 
(0.0012)| (0.0070)} (0.0114)! (0.0280)} (0.0284)! (0.0423) (0) 
0.99 16500 651 160 71 39 24 8 
16650 679 169 76 42 26 12 
(0.0003) | (0.0013) | (0.0022) } (0.0028) ) (0.0015) | (0.0046) | (0.0017) 




















For n S 150 the entries are all exact; for n > 150 the entries involve approxi- 
mations. The pattern of increases and decreases of the probability as a function 
of n was also used to obtain the first two entries for large n. 
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selected values of 8* and P*). Table III gives the required sample size 
for the case of k pairwise disjoint, exhaustive and equi-probable cells 
(Ci, Co, ---, Ce) for k = 2, 3, 4, 5 and 10 (for selected values of 6* 
and P*), Table IV gives the required sample size for k = 2, F(C,) = 
F(C,) = p for p = 0.2, 0.1 and 0.05 (here the cells are disjoint and equi- 
probable but not exhaustive). Table V considers the same problem as 
in Table III and compares the required sample sizes for infinite popula- 
tions, VN = o, with those for finite populations of size N for N = 60, 
120, 360. Tables VI and VII give illustrations of the error involved in’ 
using the approximations used in Tables IV and V, respectively, instead 
of an exact probability calculation. 


Tig. 2 shows for selected values of P* that the sample sizes in Table I 
and in the first portion of Table II can be “linearized” for large n on a log- 
log plot of n versus 6*. Figs. 3 and 4 show the same result for the last 
and middle portion of Table II, respectively. 


TaB.e II 


Minimum sample size required to attain a probability of at least P* that 
a sample will be simultaneously representative to within a common 
allowance 6* of two disjoint and exhaustive cells separated by the 100 
pth percentile for any true distribution. (The ie of yy 


ness is then defined as d,* = /a-® aa BY (1 - =) (4 rata steer =) 


50th Percentile (Median) 20th or 80th Percentile 10th or 90th Percentile 
(p = 0.50) (p = 0.20 or 0.80) (p = 0.10 or 0.90) 














» 0.01 0.05 | 0.10 | 0.15 | 0.20 0.01 0.05 | 0.10 | 0.15 | 0.20 0.01 0.05 | 0.10 





1,051} 31} 5 
1,700} 60| 5 


211 662] 12] 7 
37| 1,062} 32] 7 
2,600 | 100} 20 31/ 1,662) 52] 10 1¢} 900} 20] 1+ 
3,251 | 120] 25 31| 2,062 | 72] 10 14} 1,100 | 40! 1+ 


0.50 5 ly} «63855 | 14; If 
0.60 5 

0.70 8 

0.75 11 

0.80} 4,051 | 151 | 385] 14] 9] 2,562 | 92] 20] 12] If) 1,400; 40] If 
0.85 17 

0.90 28 

0.95 37 

0.99 


ly} 500; 14] If 


WOOoon 


5,100 | 191 | 45 10 | 3,262 | 120 | 27/12)/ 3t| 1,800; 60; 1} 

6,700 | 251 | 60 13 | 4,262 | 160 | 37 | 15) 5 | 2,355 | 80] If 

9,551 | 371 | 90 20 | 6,100 | 232 | 50 | 20 | 10 | 3,400 | 120 | 10 
16,500 | 651 | 160 | 71 | 39 | 10,562 | 420 | 100 | 40 | 20 | 5,900 | 220 | 15 





























For n S 150 the entries are all exact; for n > 150 the entries are based on ap- 
proximations together with a knowledge of the monotonicity pattern of the 
probability of representativeness as a function of n. 

+ Small entries for certain pairs (6*, P*) indicate a condition too weak for prac- 
tical usage. 


Tasie III 
Minimum sample size required to attain a probability of at least P* that 
a sample will be simultaneously representative to within a common 
allowance 6* of k equi-probable disjoint and exhaustive cells for any 


true distribution. (The degree of representativeness is then defined as 
dj* = 1 — k@*). 




















k=2 k=3 k=4 k=5 k = 10 
ae 0.05 0.10 {| 0.20 | 0.05 0.10 | 0.20; 0.05 0.10 |.0.20 | 0.05 0.10 | 0.20] 0.05 | 0.10 
0.50; 31 5] 2/102] 21 | 6/120) 26| 9/120) 304 5 | 100 | 20 
°0.60 | 60 5} 3] 141} 30] 6) 140] 388]; 9 | 140) 30] 5 | 100 | 20 
0.70 | 100 | 20] 31] 180] 47 | 12] 180 | 48 | 12) 180} 40] 5 | 120 | 30 
0.75 | 120 | 25) 3) 222) 51} 14 | 200] 52) 14 | 200 | 50 | 10 | 120 | 30 
0.80 | 151 | 35 | 9 | 240 | 60] 15 | 240] 604 14 | 220 | 50; 10 | 140 | 30 
0.85 ; 191 |} 45] 10 } 300} 72) 15 | 280 | 66] 16 | 240 | 60 | 15 | 160 | 30 
0.90 | 251 | 60! 13 | 360 | 90 | 21 | 320] 80 | 18 | 280) 70 | 15 | 160 | 40 
0.95 | 371 | 90 | 20 | 480 | 120 | 29 | 400 | 100 } 27 | 360 | 90 | 23 | 200 | 50 
0.99 | 651 | 160 | 39 | 741 | 180 | 45 | 600 | 146 | 38 | 500 | 120 | 35 | 260 | 60 














For k = 3 probabilities were computed exactly only for n S (200/k); for n > 
(200/k) the approximation in Appendix 2 was used together with a knowledge of 
the monotonicity pattern of the probability of representativeness as a function 
of n. 


TasBie IV 


Minimum sample size required to attain a probability of at least P* 
that a sample will be simultaneously representative to within a common 
allowance 8* of any two disjoint equi-probable cells defined by percen- 
tiles and having a common probability p under the true, unknown dis- 
tribution. (The degree of representativeness is then defined as d,* = 
1 — B*/p.) 



































Below 20th and Above Below 10th and Above Below 5th and Above 

Application 80th Percentiles 90th Percentiles 95th Percentiles 

(p = 0.20) (p = 0.10) (p = 0.05) 

p* B* 0.01 0.05 0.10 0.01 0.05 0.10 0.01 0.05 
0.50 1,700 52 10 900 20 If 450 ly 
0.60 2, 262 72 10 1,255 40 lt 600 lj 
0.70 3,000 112 20 1,655 54 lj 850 1t 
0.75 3, 500 132 30 1,955 60 1; | 1,000 1t 
0.80 4,100 152 30 2,300 80 1y | 1,150 lj 
0.85 4,900 180 40 2,700 100 10 1,400 1} 
0.90 6,000 232 50 3,355 120 20 1,750 lt 
0.95 7,900 300 70 4,455 160 35 2, 250 80 
0.99 12,562 492 120 7,000 274 65 3,650 130 

Another Between 30th and 50th | Between 40th and Between 45th 
Application percentiles and _ be- 50th percentiles and and 50th per- 
tween 50th and 70th between 50th and centiles and 
percentiles 60th percentiles between 50th 
and 55th per- 

centiles 








For n S 40 the entires are exact; for » > 40 normal approximation theory 
was used. 

{ Small entires for certain pairs (6*, P*) indicate a condition too weak for 
practical usage. 
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TABLE V 

Minimum sample size required to attain a probability of at least P* 
that a sample from a population of size N will be simultaneously repre- 
sentative to within a common allowance 8* of k equi-probable disjoint 
and exhaustive cells for any true population. (The degree of represen- 
tativeness is then defined as d,* = 1 — k@*). 

The four entries in each set below correspond to N = 60, 120, 360, ~, 
respectively. 





























k=2 k= 3 k=4 k=5 k = 10 
Oe 0.05 | 0.10 | 0.20] 0.05 | 0.10 | 0.20} 0.05 | 0.10 | 0.20} 0.05 | 0.10 | 0.20] 0.05 | 0.10 
0.50 | 20 5| 2} 40; 19] 6] 40; 20] 7] 40} 20; 34 34; 10 
20 5; 2] 55] 21) 6] 60] 20] 7} 60] 20] 5} 54] 15 
20 5] 2{ 81{ 21] 6] 80) 20| 7] 80] 24] 5| 74) 15 
31 5 | 2] 102; 21] 6/120] 26] 7] 120; 30; 5 | 100) 20 
0 75 | 40] 15] 3| 47} 28112| 47{ 26|12 | 45 | 27| 8 | 40 | 20 
60 | 20) 3} 76] 387/14] 74} 88; 12] 72; 30; 8] 60) 25 
91} 25; 3] 186 | 49 | 14] 1380; 40] 14; 120) 40] 10 | 94} 25 
120 | 25 | 3 | 222 | 51115 | 200 | 52 14 | 200 {| 50 | 10 | 120 | 30 
0.85 | 51] 25) 9} 53] 30] 14] 50] 32] 14] 49} 30; 10; 40] 20 
71 { 30|10| 84| 49,15); 80} 40114] 80} 40) 10) 60} 25 
120 | 40/10] 162; 60] 15 | 150] 58] 16 | 152; 50) 138 | 100 | 30 
191 | 45 | 10 | 300] 72] 15 | 280 | 66} 16 | 240/ 60, 15 | 160 | 30 








0.90] 51] 30/10; 54] 387/15] 50; 38); 16) 51) 30] 138 | 40) 25 
80! 40/13] 938] 51)19]) 90} 46) 16; 80] 40] 13] 74 | 25 
151} 50} 13} 180) 72} 21)170} 60) 18 | 160} 60) 15 | 114 } 35 
251 | 60 | 13 | 360 | 90 | 21 | 320; 80 20 | 280; 70 | 15 | 160 | 40 
0.95} 51] 35) 16} 54] 42) 21] 50) 38] 18 | 52) 37] 15] 47 | 25 
91| 50/19}; 94] 60] 25] 90}; 58) 20] 92} 50) 15] 74 | 30 
180 | 70 | 20 | 201 | 88 | 27 | 190 | 80 | 25 | 180; 70 | 18 | 120 | 40 
371 | 90 | 20 | 480 | 120 | 30 | 400 | 100 | 27 | 360 | 90 | 20 | 200 | 50 





0.99 | 60} 45] 23 | 55 | 48 | 27}; 57) 43) 25| 53; 40] 20 | 49 | 30 
100 | 70 | 30 | 102 | 72 | 30/100] 66] 29 | 98; 60/ 23 80} 40 
231 | 110 | 36 | 240 | 120 | 42 | 220 | 100 | 34 | 212) 90 | 25 | 154 | 50 
651 | 160 | 39 | 741 | 180 | 45 | 600 | 146 | 37 | 500 | 120 | 30 | 260 | 60 























For finite populations all entries withn S 2/8* are based on exact computations ; 
the entries with n > 2/8* are based on the approximation in equation (A17) of 
Appendix II. Another simpler approximation is given in equation (A18) of Ap- 
pendix II 


TaBLE VI 


Comparison between the exact value of and the normal approximation 
to the joint probability that in a sample of size n from an infinite popu- 
lation the number of observations falling in each of two tails with com- 
mon probability p is between n(p — B*) and n(p + £6*), inclusive. 























= 0.10 = 0.20 
p* = 0.05 B* = 0.05 B 

n = 10 Normal Approx. 0.1628 0.0973 
Exact 0.1510 0.0941 

Error +0.0118 +0.0032 

n = 20 Normal Approx. 0.5482 0.3654 
Exact 0.5566 0.3648 

Error —0.0134 +0.0006 

n = 40 Normal Approx. 0.6608 0.4655 
Exact 0.6731 0.4669 

Error —0.0123 —0.0014 

Taste VII 


Comparison between the exact value of and the normal approximation 
to the joint probability that in a sample of size n from a population of 
size N the number of observations falling in each of k equi-probable cells 


i i 1 1 
is between n(¢ - i) and n(¢ + — 


k 


20 


) , inclusive. 


N= 0 (Infinite Population) 





















































k=2 k=3 k=4 k=5 k = 10 
n = 20 | Normal Approx. 0.4977 0.1166 0.1600 0.1172 0.0698 
Exact 0.4966 0.1145 0.1618 0.0955 0.0669 
Error +0.0011 | +0.0021 | —0.0018 | +0.0217 | +0.0029 
n = 40 | Normal Approx. 0.5708 0.2196 0.2388 0.1962 0.1775 
Exact 0.5704 0.2181 0.2363 0.1904 0.1478 
Error +0.0004 | +0.0015 | +0.0025 | +0.0058 | +0.0297 
n = 60 | Normal Approx. 0.6338 0.3974 0.3230 0.2876 0.3325 
Exact 0.6338 0.3982 0.3174 0.2979 = 
Error 0.0000 | —0.0008 | +0.0056 | —0.0103 ‘3 
N = 120 (Finite Population) 
k=2 k=3 k=4 k=5 k = 10 
n = 20 | Normal Approx. 0.5357 0.1397 0.1984 0.1550 0.1092 
Exact 0.5368 0.1359 0.1801 0.1547 0.1011 
Error —0.0011 | +0.0038 | +0.0183 | +0.0003 | +0.0081 
n = 40 | Normal Approx. 0.6651 0.2822 0.3705 0.3413 0.4291 
Exact 0.6670 0.3084 0.3679 0.3313 0.3357 
Error —0.0019 | —0.0262 } +0.0026 | +0.0100 | +0.0934 
n = 60 | Normal Approx. 0.7969 0.6338 0.6115 0.6228 0.8507 
Exact 0.7989 0.6104 0.6003 0.5972 ? 
Error —0.0020 | +0.0234 |; +0.0112 | +0.0256 . 


* Not computed. 
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F7-1(Q) F102) F710.4) F71(0.6) F71(0.8) ; F-1(1,0) 


Fig. 1 — Pictorial diagram of representativeness using deciles (k = 10). 
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Fig. 2 — Minimum sample size n required to attain a probability of at least P* 
that a sample is simultaneously representative to within a common allowance 6* 
of two disjoint and exhaustive cells each having probability p = 44 under the true 
unknown distribution. (The degree of representativeness is dj* = 1 — 26*.) 
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Fig. 3 — Minimum sample size n required to attain a probability of at least 
P* that a sample is simultaneously representative to within a common allowance 
6* of the two disjoint, exhaustive cells separated by the 10th (or the 90th) per- 
centile for any true distribution. [The degree of representativeness is dj* = 


(3) V@.1 — B*) (0.9 — 6*).] 
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Fig. 4 — Minimum sample size n required to attain a probability of at least P* 
that a sample is simultaneously representative to within a common allowance 6* 
of the two disjoint, exhaustive cells separated by the 20th (or the 80th) per- 
centile for any true distribution. [The degree of representativeness is d,* = (8) 


V0.2 — B*) 0.8 — B*),] 
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V. EMPIRICALLY OBSERVED MONOTONICITIES 


It is interesting to note in Table III that for fixed 6* and increasing 
k the sample size n required is not monotonic but appears to reach a 
maximum and then decrease. As a result of this it becomes possible to 
to speak of the sample size n required for a sample to be representative 
for any specified 6* regardless of the number k of pairwise disjoint, ex- 
haustive, equi-probable cells considered, provided only that k < 1/8*. 
For example, for 6* = 0.1 it appears likely from Table JII that 90 ob- 
servations would be sufficient to have a confidence of at least P* = 0.90 
that the sample is representative in the sense of (2) for any one value of 
Ae =. 1; 2, +2%, 10). 

Table VIII, some of whose entries are taken from Table III, shows 
numerically that for fixed d,* the required sample size is a monotonically 
non-decreasing function not only of P* but also of k; for fixed B*. Table III 
shows numerically that only the monotonicity with P* holds. The former 
result is again shown in Figs. 5 and 6 which also emphasize the possi- 
bilities of interpolation on k. 

The above monotonicities and lack of monotonicities have not been 
demonstrated mathematically. 


Tasie VIII 


Minimum sample size required to attain a probability of at least P* that 
a sample will be simultaneously representative to a degree d,* = 1 — kg* 
of k equi-probable disjoint and exhaustive cells for any true distribu- 
tion. 




















d,* = 0.80 dg* = 0.90 

P* 

k=2 k=4 k = 10 k=2 k=5 k = 10 
0.50 5 120 600 31 800 2500 
0.60 5 140 700 60 950 2800 
0.70 20 180 800 100 1150 3200 
0.75 25 200 850 120 1250 3400 
0.80 35 240 900 151 1400 3700 
0.85 45 280 1000 191 1600 4000 
0.90 60 320 1100 251 1850 4400 
0.95 90 400 1250 371 2250 5100 
0.99 160 600 1650 651 3150 6600 





In comparing results for a fized degree d,* it should be noted that the sample 
size appears to be a monotonically non-decreasing function of P* and also of k; 
for a fized common allowance f* only the monotonicity with P* holds as is evident 
in Table II. The remarks at the bottom of Table III apply here also. 


148 


THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1958 


VI. CONFIDENCE BANDS—INFINITE POPULATION CASE 


The experimenter will usually be interested in the confidence state- 
ment that the above formulation allows him to make after the observa- 
tions are taken. Suppose, for example, that he was interested in representa- 
tiveness in each of k = 10 pairwise disjoint, exhaustive and equi-probable 
cells and that he specified 6* = 0.02 (so that d,* = 0.80) and P* = 0.85 
and that he has taken 1,000 observations in accordance with Table VIII. 


















































Fig. 5 — Minimum sample size n required 
to attain a probability of at least P* that a 
sample will be simultaneously representa- 
tive to a degree d,* = 0.90 of k equi-proba- 
ble, disjoint and exhaustive cells for any 
true distribution. The common allowance 
6* is given by B* = (1 — d,*)/k = 0.10/k. 







































































Fic. 6 — Minimum sample sizen required 
to attain a probability of at least P* that a 
sample will be simultaneously representa- 
tive toa degree d,* = 0.80 of & equi-proba- 
ble, disjoint and exhaustive cells for any 
true distribution. The common allowance 
B* is given by B* = (1 — d,*)/k = 0.20/k. 
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He can then make a number of confidence statements about the popula- 
tion deciles F~'(0.1), F~'(0.2), ---, F~'(0.9) (and also about F~'(0) 
and F'(1) defined as the greatest lower bound of all x for which F(x) > 0 
and the least upper bound of all x for which F(x) < 1, respectively). 
For example, if x, denotes the mth (smallest) ordered observation, it 
follows from the condition of representativeness that we have szmul- 
taneously with joint confidence greater than P* all of the inequalities 


~o <F'0) <2 two <F (1) S & 

X20 < F™ (0.1) < X01 ts << F*(0.9) < X920 

Yio S F*(0.2) < Xoar X90 << F"(0.8) <= Zs10 

t aiah deideattntancs Seales Ae 2 Gnd A eho eeaereeie Gest. 116) 
too S F (0.8) < X61 wi =< F~*(0.2) <= Xse0 

tm <F"(0.9) < « —0 <F'(0.1) S a 

L000 SS F“(1) So OS Ss F™“(0) es 


For example, F*(0.2) must be greater than or equal to 216 and less than 
Xo In the confidence statement since under the condition of represen- 
tativeness all cells and, in particular, the last two cells on the left con- 
tain between 80 and 120 observations, inclusive. 

The right hand set of inequalities are in reverse order since they are ob- 
tained by similar reasoning as the left hand set except that we start at 
the right end of the distribution and work backwards. If we keep only 
the stronger results in (6) for each decile and disregard the weaker ones, 
then we obtain eleven (finite or infinite) line segments as in Fig. 7. We 
can then state with joint confidence greater than P* that the unknown 
distribution F has a (finite or infinite) point of contact with (or a saltus 
passing through) each of the line segments; the two end segments are 


PROBABILITY SCALE 





Ogden ies 
1 Xa a9 Xio00 


xy Li24 Xo4y L361 Lge, = Keor = Leet L7E1 Lear Loa 


Fig. 7 — Confidence intervals for the deciles with joint confidence level P* = 
0.85 for k = 10, nm = 1000 and g* = 0.02 (which implies that d,* = 0.80). 
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actually half-lines and in these cases we must allow + © and —© as 
possible ‘‘points”’ of contact. 

The above result then gives rise to two “staircases”, as in the middle 
diagram of Fig. 8, such that any distribution contacting every line seg- 
ment in lig. 7 must everywhere lie between (or on the boundary of) the 
two “staircases”. Hence we can state with confidence greater than P* 
(see explanation below) that the two “staircases” form a confidence 
band on the unknown distribution. 

If we keep k and P* fixed and decrease §* (or increase d,* = 1 — k§*) 


1.00 


PROBABILITY SCALE 





L25 L490 X73 Xg7 Lyay Ly2q Lizz L445 L153 


1.00 
0.80 
0.60 
0.40 


0.20 


PROBABILITY SCALE 





Lieo Lao L400 | Xs20 Leao | Lre0 | L980 
Lr24 L244 L361 Laat Leo4 Lear = L761 Leas Loa4 


1.00 
0.80 
0,60 
0.40 


0.20 


PROBABILITY SCALE 


Lr20 Lieo0 [C2240 L3120 L3560 
Laat = Xggr L321 i793 L2201 Laser L2921 3281 C3641 





Fig. 8 — Confidence bands which include the true distribution function with 
confidence greater than P* = 0.85 for k = 10 and d,* = 0.5, 0.8, 0.9. Small circles 
between the confidence bands represent ordinates of the sample distribution 
function. The three figures above were constructed with observations obtained 
from a table of random normal deviates (with different horizontal scaling applied 
in each case). 
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then the required sample size increases and the confidence band becomes 
narrower. This is illustrated in the three diagrams of Fig. 8. 

It should be noted that the inequalities (6) are implied by but do not 
imply (i.e., they are not equivalent to) the condition of representative- 
ness. Hence the confidence level associated with (6) is greater than the 
specified P*. To illustrate this we note from (6) the stronger inequalities 


X30 < F(0.1) < X01 and 160 < I *(0.2) <i Wo4 . (7) 


These inequalities (7) allow as few as 40 and as many as 161 observations 
between J *(0.1) and F*(0.2), including endpoints. On the other hand 
we have confidence P*, under the condition of representativeness, that 
every such cell contains between 80 and 120 observations, inclusive. This 
shows that the confidence level associated with the confidence band is 
greater than the probability achieved for the representativeness of the 
sample. 

This method of obtaining a confidence band for the unknown dis- 
tribution would be more valuable if we could obtain a simple way of 
computing (or estimating more accurately) the actual confidence level 
attained. For example, with k = 3, 6* = 0.10 (so that d,* = 0.70) 
and P* = 0.60 we obtain n = 30 from Table ITI, the probability achieved 
for representativeness is 0.6369 and the confidence level associated with 
the two “staircases” is 0.6825. The latter is obtained by using inequali- 
ties similar to (6) and computing the probability exactly with a multi- 
nomial distribution. The reader should note that the idea of a confidence 
band containing the true, unknown distribution is not the main theme 
of this paper but only an interesting by-product of the idea of the repre- 
sentativeness of the sample. 


APPENDIX I 
Exact Formulae — Finite and Infinite Populations 


The concept of the representativeness of a sample can be applied to 
finite as well as infinite populations. Let N denote the total size of a 
finite population; conceptually we may regard the population as being 
partitioned into k subsets S; of size F(S;)(¢ = 1, 2, ---, k). We shall 
assume that the sets S; are pairwise disjoint and, to simplify the discus- 
sion, we also assume that the quantities VN; = NF(S,)(@i = 1, 2,--- , k) 
are positive integers. 

Let x; = O denote the random integral number of observations in the 


observed sample of size n which fall in the set S;(2 = 1, 2, ---, k). If 
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the k sets S; are exhaustive then 


Siat =n and '41N; = N. (Al) 
We define for z = 1, 2, ---,k 
c; = n[F(S;) — 8*] and d; = n[F(S;) + 8", (A2) 


which are non-negative but need not be integers. Then for a finite popu- 
lation the probability corresponding to the left number of (5), using the 
hypergeometric distribution, is given exactly by 


PIN; , a: , 0: @ = 1,2, -°-,B] = Si /() (A3) 


n 


where 2) is the usual binomial coefficient and the summation in (A3) 


is over all vectors X = {21, 2%, ---, a} for which . 
S24: d; (4 = 1,2, +++, k). (A4) 


If the k& sets are not exhaustive then we define another set S;4:1 which is 
the complement of the union of the k sets S; and use (A8) with k replaced 
by & + 1 in (Al) and (A8) but not in (A4), ie., no condition is applied 
to the (k + 1)th variable. 

In the case of an infinite population we use the multinomial distribu- 
tion. If the & sets S; are exhaustive, then using (A2) and letting p; = 
F(S;)(i = 1, 2, --+ , k) the left hand member of (5) is given exactly by 


! k 
Pps, Bi* 6 = 1,2,-+-,6)1 = Se —T1@2) (8) 


II (as!) 


t=1 





where the summation is again over all vectors xX = {a1, 22, --+, 2} 
satisfying (Al) and (A4). If the & sets are not exhaustive then we define 
Sx41 as above and the same expression (A5) is obtained with & replaced 
by k + 1 in (A1) and (A5) but no# in (A4), 1Le., no condition is applied 
to the (k + 1)th variable. 

It is interesting to note that the results for the infinite case (V = «) 
can be obtained from those of the finite case by letting N tend to in- 
finity. Table V illustrates this numerically since the four entries in each 
set correspond to N = 60, 120, 360 and o, respectively. 


APPENDIX II 


Approximate Solutions — Infinite and Finite Populations 


Let 2; denote the random integral number of observations in a sample 
of size m which fall in the 7th cell (@ = 1, 2, ---, k). If we let 
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eae k 
yi; = x; — (n/k), then the two conditions >\i-1 2; = n and 


dint Yi = 0 (A6) 


are equivalent. Let [x] denote the largest integer not greater than x. 
We shall consider only the case of the equi-probable exhaustive sets. 
In the case of an infinite population we wish to compute 


p= P\n(— pt) sm sn(t+ at) 


@ = 1,2,---,k) 


(A7) 





i=1 
If we introduce a continuity correction and use (A6) then we obtain 


P = P{-b; S y; S$ ai = 1,2, ---, BE) |DoEiy: = 0} (A8) 


where for each 7(2 = 1, 2, --- , k) 


1 n n | a. 0 n 
a= h+|nar+2|—2 and b= 5 +| not t| +¢. (A9) 


k 
If n/k is an integer and 8* is the common value of 6,*(¢ = 1, 2, --+ , k) 
then a1 = @2 = -°+ = a = bf, = bb = +++ = be = a (say) and (A8) 
reduces to 


P=P{|y| Sa@=1,2,---,b)|Diays = 0} (A110) 


where a = 34 + [n@*]. 

To compute (A10) two approximations are made. The k-variate multi- 
nomial probability is first transformed by an orthogonal transformation 
into a (k — 1)-variate distribution with homoscedastic and uncorrelated 
variables and the first approximation is to replace the latter distribution 
by a multivariate normal distribution with independent variables. The 
region of integration is the intersection of the hypercube |y;| S a 
centered at the origin with edge-length 2a and the hyperplane (A6); 
the orthogonal transformation merely rotates this intersection about the 
origin. These intersections are convex figures symmetric with respect to 
the origin; for example, it is a regular centered hexagon for k = 3. These 
intersections, called Stott figures, are discussed in Appendix III. The 
second approximation made in computing (A10) was to replace the 
Stott figure by a (k — 1)-dimensional central sphere whose radius R 
is determined by equating the two hypervolumes. Values of R for k = 
2(1)12 for any a are given in Table IX. 
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Taste [IX 


Intersection J of the hypercube of edge-length 2a centered at the origin 
and the hyperplane 2 + v2 + --: + 2% = 0. 











Dimension k of J(k) = Number of equally large Radius R of sphere with content 
hypercube simplices in g equal to that of g 
2 1 1.4142 a 
3 6 1.2861 a 
4 4 1.3655 a 
5 230 1.4486 a 
6 66 1.5225 a 
7 23,548 1.5995 a 
8 2,416 1.6733 a 
9 4,675,014 1.7443 a 
10 156, 190 1.8126 a 
11 1, 527,092, 468 1.8786 a 
12 15,724, 248 1.9422 a 





The content I(k) of g for all k is given by 


akla/k 
(k — 1)! 





I(k) = a) — He — Der Dk — yet — - 


where the terms continue only as long as the arguments k, k — 2, --- are positive. 
The radius R of a (k — 1)-dimensional sphere of equal content is obtained by 


x k+1 
equating 7(k) and ceva /'v en . 


The orthogonal transformation referred to above is 


, 1 : 
wl = aay nt ye boo Foye tia) (A11) 





G = 1,2, «++, %) 
where 7,41 is defined to be identically zero. Then y,’ is identically zero 
by (A6). The remaining y,’ all have a common variance ; since for each 
ia = 1,2,---,k —1) 


2 1 ose k- 1 
od see e+ Om (EE) 


t n of nm\| _n 
+2G)(-3)-2(-B)} oF 


and are pairwise uncorrelated since for z < 7 


(A12) 
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; : 1 ni(k — 1) 
Puiu’ — Oulu’ ~ G+ DIGED\ # 


+2 (5)(- *.) a) (- *.) 4G =i): (R13) 
(-%) -MEEY + —w(-B)}=0 


If we let » = k — 1, let r = R/o = Rv/k/n and let S denote the 
central sphere of radius r then the approximate probability (dropping 
primes) is given by 


if v/2 1 y r 
pe fo [(s) exp{— 3D w'} dn dy - dy, (A14) 


= Pp {x0" = r’} 








where x,’ denotes a chi-square random variable with v degrees of freedom. 
In the case of a finite population of size N the only change in the above 
discussion is to replace (A12) by 


n{N — i 
ee = 2 (q — ") (= 1,2,---,/ -—1) (A15) 
thus increasing the value of r’ and the value of P; this decreases n if P 
is held fixed at any P*. If we let ny and n., denote the required values 
for a finite population of size N and an infinite population, respectively, 
for the same fixed k, 6* and P* then we obtain from (A14) and (A15) 


Ne & A), (A16) 





or, taking the smaller solution in ny , we have for large NV 


~ VN — VN? — AN — 1)n, 
5 . (A17) 





rn = 


Replacing N — 1 by N in (A16) we easily obtain for large N the simpler 
result 


—2—-_. (A18) 


The error in P involved in both of the above approximations (A14) 
and (A17) is evaluated in Table VII for N = 120 and N = o for se- 
lected values of n, 6* and k. 

If n/k is not an integer then the above discussion may not apply since 
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~ a; may not equal 0; in (A9). Assuming again a common (* then we have 
a common “a” and a common “b” in (AQ). In this case, averaging the 
approximate probabilities obtained by using 2a and 2b alternately as 
the edge-length of the hypercube was found to be satisfactory for com- 
puting the tables of this paper. 


APPENDIX III 
Geometric Results and Eulerian (Diamond) Numbers 


The problem here is to find the (& — 1)-dimensional content (or hyper- 
volume) of the intersection J of the centered k-dimensional hypercube 
ly:| < a(t = 1, 2, ---, k) and the (&k — 1)-dimensional hyperplane 
Yi t+ Yo +--+ + ye = 0. The geometry for even & and odd k is quite 
different. The number of vertices of g for even k and odd k, respectively, 


is 
co and Caen oe (A19) 


for example, for k = 3 we obtain the 3 (7) = 6 vertices (a, —a, 0), 


(—a, a, 0), (a, 0, —a), (—a, 0, a), (0, a, — a) and (0, —a, a). The vertices 
are all equally distant from the origin. All the edges of 9 have a common 
length d = d(k) which equals 2a+/2 for even k and av/2 for odd k. The 
intersection J is a convex figure which is symmetric with respect to the 
origin and is known as a Stott figure.” The Stott figure can be parti- 
tioned into an integral number J(k) of (& — 1)-dimensional simplices 
which are not necessarily regular but are such that each simplex has the 
same content as a regular (k — 1)-dimensional simplex with edge- 
length d. Hence, using a result on page 125 of Reference 8, the content 
I(k) of J is given by 


. (VV Vk | 
I(k) = (x?) eit. (A20) 


The integers J(k) are given in the middle column of Table IX; for ex- 
ample, the integer 6 for k = 3 indicates that there are six equilateral 
triangles in the centered hexagon. 

D. Slepian’ has shown that for even k the integers J(k) can be found 
by generating a “triangle” of numbers using the recurrence relation 


Sig = fSen.3 + tSip4 G7 = 1,2,+--) (A271) 


with boundary conditions S;,; = S;, = 1 for all 7; then the desired 
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quantities are 
Si: = J(22) (¢@ = 1,2,---). (A22) 
Similarly for odd & he showed that we can use the recurrence relation 
Tog = QF + DT i413 4+ Qt4+ YTip4 G7 = 1,2,+--) (A23) 


with boundary conditions 7,; = 7; = 1 for all 7; then the desired 
quantities are 


T,<= J(@%+1) G@=1,2,---). (A24) 


Fig. 9 shows these numbers in two diamond-shaped patterns and ex- 
plains another interesting way of obtaining these numbers. 


”» A 
Aver Ahr 
Appr APS 


prerprpr  proroho 
gepysed duds yd 


1682 


191 2416 Nel es 23,548 10, te 
15,619 259,723 259,723 
15,619 A e K 
156,190 4,675,014 
/N\ /N\ 
Fig. 9 — Combinatoric derivation of certain Eulerian (diamond) numbers. 


The number at any vertex V is obtained by considering any one path from the top 
vertex to V, multiplying the circled numbers encountered in this path, and sum- 
ming the results obtained over all possible downward paths from the top vertex 
to V. In particular, the values on the vertical diagonal (of the diamond) are the 
values of J(k) in Table IX. It is interesting to note that the sum of all the un- 
circled numbers in the mth row is 2”~! (m — 1)! for the odd case and m! for the even 
case. This is shown above for m = 1, 2, 3, 4, 5 and would hold for all m if this 
pattern were continued indefinitely. The circled numbers are obtained by num- 
bering the parallel diagonal lines starting with one at the “‘top,’’ using all pos- 
itive integers in the even case and only odd integers in the odd case. 
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The integers J(k) arise in connection with combinatorial problems. 
As an example for even k, suppose we draw at random m balls in suc- 
cession from an urn containing m balls marked 1, 2, --- , m. Let X de- 
note the number of times that the observed number increases, (say) 
always counting the first draw as an increase. Then it can be shown that 


Bix = j} = Sj,m41—;/m! @] = 1, 2, ae m), (A25) 


ie., the mth row of the left diamond Fig. 9 divided by the sum m! of 
that row gives the elementary probability distribution of X. 

The problem of computing (A25) also arose in the work of V. H. 
Moore and W. A. Wallist and M. MacMahon? who referred to it as 
Simon Newcomb’s problem. J. Riordan> has studied the numbers J(k) 
for even k and Carlitz and Riordan® call them Eulerian numbers (to 
be distinguished from the classical Euler numbers); an explicit formula 
as well as a generating function appears in these papers. The S;,; are 
related to the Eulerian numbers A, (defined in Reference 5) by S;,; = 
Aisj-t, j- 

Explicit expressions for J(k) for odd and even k are obtainable from 
(A22), (A24) and the more general results 


Sis= DT (-DCG = (A26) 
T:3 = > (y(t) pg — o) + 1% (A27) 


due to D. Slepian.’ It is easily shown that these formulae satisfy the 
corresponding recurrence relations as well as the boundary conditions. 
By an induction and symmetry argument applied to (A21) and (A23) 
and the boundary conditions it is easy to prove that 


Sij = S3,: and Ts = T 5. . (A28) 
Substituting (A26) and (A27) in (A28) gives rise to interesting, non- 


trivial identities. For completeness we also give the generating functions 
derived by D. Slepian’ 








~ S: jt'u? tu(e’ ja e") 
Gace ei ieee A29 
Sijt'u’ | Foe tay 
sy eee 
t,j=1 (2 + q)! 08 te“ — uet (A30) 
Poe, td = ttu 


Mo it jt te — wer’ 
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The final result for the content J(k) of J can, using the above be. 
written as a single expression 


a [(k—-1) /2] 
I(k) = a ms p> (—1)* @ (k — 2a)" — (A32) 


for all k where [xz] denotes the largest integer not greater than x. It has 
been pointed out by J. W. Tukey that (A382) can also be obtained by proba- 
bilistic considerations and that it appears in Laplace’s “Theorie Ana- 
lytique”’ (Book 2, page 260). 


APPENDIX IV 
Remarks on the Confidence Bands 


It should be remarked that other assumptions on the true, unknown 
distribution can be used in conjunction with the confidence bands ob- 
tained in Section VI. It has been pointed out by J. W. Tukey, for example, 
that in the case of the first diagram in Fig. 8 the experimenter might be 
willing to assume that the true distribution is unimodal and that the 
mode zm is such that am < ae. Then on purely geometrical considera- 
tions it can be shown that the confidence band can be modified as shown 
in the first diagram of Tig. 10. Briefly, if the true distribution enters any 
one of the three deleted triangles with any slope s then in order to get 
out again without leaving the confidence band the slope must get larger 
than s. But this contradicts the assumption that the’ density steadily 
decreases after 2e . 

Similarly, with the same problem, if the experimenter assumes that 
the true distribution is unimodal and that 273 S tm S 2s then the first 
diagram of J’ig. 8 can be modified as in the second diagram of Fig. 10. 
The assumption of unimodality is reasonable in many different practical 
applications but has not often been utilized in statistical techniques. 

It is possible to formulate a problem for fixed P* and n which requires 
the determination of that 4 which makes the maximum (or some average) 
vertical width of the confidence bands as small as possible. For example, 
for P* = 0.85 and m = 240 the value & = 10 minimizes the maximum 
vertical width. It should be pointed out that if the experimenter’s prin- 
cipal interest is in finding confidence bands with small vertical widths 
then this procedure appears to be quite inefficient compared with that 
based on the Kolmogorov statistic.’ 

A proper comparison is difficult since the nominal P* is a lower bound 
and not the correct value of the confidence level associated with the pro- 
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Fig. 10 — Modified confidence bands which include the true distribution func- 
tion with confidence greater than P* = 0.85 for k = 10 and d,* = 0.5. 


posed confidence bands. As mentioned in the body of the paper the de- 
velopment of a confidence band is just a by-product of the main theme 
of this paper which is the representativeness of the sample. 


VII. CONCLUSION 


Definitions of representativness and of degree of representativeness are 
given and tables are included which give the sample size required to 
guarantee with preassigned probability P* that a random sample will 
satisfy a condition of representativeness, the definition of which is 
agreed upon in advance. Thus, for experimenters who wish to know in 
advance how many observations will be needed for a distribution study, 
the problem has been given a precise nonparametric formulation and the 
solution has been found for some cases. 

This formulation also leads to confidence bounds on the unknown 
distribution after the observations are taken. POD: are given to illus- 
trate this. 

The tables for the case of pairwise disjoint, equi-probable and exhaus- 
tive cells may also prove to be useful for the problem of determining the 
sample size required to obtain s¢multaneous confidence limits (on a 
preassigned level P*) for all of the cell probabilities of a multinomial 
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distribution. Further investigation is needed to state precisely the con- 
ditions under which these tables can be used for this related problem. 
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Fluctuations of Random Noise Power* 


By D. SLEPIAN 


(Manuscript received September 4, 1957) 


The probability distribution of the power, y, of a sample of Gaussian 
noise of time duration T is considered. Some general theory is presented 
along with curves for the cumulative distribution and probability density of 
y for several different power spectra and values of T. 


I. INTRODUCTION 


A random quantity of interest in many communication and detection 
systems is the average power, 


1 T/2 
y=7f{ iO dt, (1) 
T J—ryj2 


of a sample of finite time duration, 7’, of a Gaussian noise, V(t). This 
quantity has been discussed in some detail by Rice in his classic paper’ 
where he obtains expressions for the first few moments of y and an ap- 
proximate probability density function. 

In this paper the exact probability density function, f(y), and the 
cumulative distribution function, F(y), of the average power are com- 
puted for a number of ergodic Gaussian noises and for a number of 
values of T. The results are presented as a series of curves which are dis- 
cussed in the next section. It is hoped that they will be of use to those 
designing specific systems. 


II. SUMMARY OF COMPUTATIONAL RESULTS 


Tig. 1 shows the probability density function, f(y), for the random 
variable y of equation (1) when N(¢) has mean zero and power spectrum 


2a 
a? + Ag?f?? 


Noise with this spectrum will be referred to as RC noise (see 5.1). 


w(f) = 


Ses fF Sp 


* The research reported here was supported in part by the Office of Naval 
Research under contract Nonr 210(00). 
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Fig. 1 — Probability density, f(y), for RC noise. 


The curves are labelled by values of 6 = aT7'/2. The curve marked 
8 = Ois the probability density function for y = N?(t). Fig. 2 shows the 
corresponding cumulative distribution functions, F(y). 

For any 8 > O, as y approaches zero, f(y) and F(y) approach zero 
more rapidly than any power of y. 

As 8 becomes large, the density function f(y) peaks up around unity 
which is the average power of N(t). The variance of y is given by 
(28) [48 — 1 + e**|. It approaches zero for large 8 like B™’. 

Figs. 3, 4 and 5 show f(y) when N(¢) has mean zero and power spec- 
trum 


2 


_ 2Q aA nt sy 
w(f) Wo 4 (2) (ot — we)? ©) 


w= 2nf, -~ SfSo. 


Noise with this spectrum will be referred to as RLC Noise (see 5.2). 
The figures are respectively for the cases Q = 1, 10 and 100. The curves 
are labelled by values of s = woJ’. The curves marked s = 0 are the 
density function for y = N?(t). The corresponding cumulative density 
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Fig. 2 — Cumulative distribution, f(y), for RC noise. 


functions, ’(y), are shown on Figs. 6, 7 and 8. The spectra for Q = 
10 and 100 are plotted on Fig. 9. 

For any s > 0 and for any finite Q > 0, as y approuthes zero, both 
f(y) and F(y) approach zero more rapidly than any power of y. 

For any fixed Q, as s becomes large, the density function f(y) peaks 
up around unity which is the average power of N(é). The variance of y 
is given by 


2 
oc = 


oo ie) 


ae >) a 


T= 


6 


For fixed Q, it approaches zero for large s like 2Q/s. 

If, however, s = wo7' is held fixed and Q is permitted to increase, Figs. 
3, 4 and 5 show that f(y) becomes less concentrated; that is, with 
fixed integration time and fixed resonant frequency, fluctuations in 
power become more pronounced as the relative width of the spectral 
peak is decreased. Indeed, one has 


Aer 
sin § 





limo =1+ 


? 
Q->00 s? 


so that 
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lim lim o” = 1, 


s>0 Q>00 
whereas, as already noted, 


lim lim o” = 0. 
Q>00 s->0o 
In the limit Q = «, the Gaussian noise can be taken to be the single 
frequency ensemble N(t) = a cos wot + D sin wot, where a and b are in- 
dependent normal variates with mean zero and variance unity. The 
density for y in this case is 


f(y) = sec ge’ *°”? Jo(ty tan ¢ sec ¢) 


where sin g = sin s/s and Jp is the usual Bessel function (see Appendix 
1). This density is plotted for several values of s in Fig. 10. It is to be 
noted that this limiting noise, although stationary, is not ergodic. It is 
this fact that causes the variance of y to be bounded away from zero as 
s — ©, Quite generally, if N(¢) has a purely continuous spectrum, the 
variance of y will approach zero as the integration time becomes infinite. 
If the spectrum of N(¢) has line components, this will not be the case. 
It is not difficult to give a qualitative argument as to why power fluc- 
tuations in a fixed time interval increase as the power spectrum becomes 
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Fig. 3 — Probability density, f(y), for RLC noise, Q = 1.0, s = woT’. 
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Fig. 5 — Probability density, f(y), for RLC noise, Q = 100, s = woT’. 
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Fig. 6 — Cumulative distribution, ’(y), for RLC noise, Q = 1.0, s = woT’. 


more peaked. Noise with the power spectrum (2) can be thought of as 
the noise voltage produced across the resistor in a series RLC circuit 
when the applied voltage to the circuit is white Gaussian noise. The 
larger the Q of the circuit, the more it tends to “ring’’ in response to an 
impulse input; i.e., the longer the transients persist. An atypical excur- 
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Fig. 7 — Cumulative distribution, F(y), for RLC noise, Q = 10, s = woT’. 
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Fig. 8 — Cumulative distribution, F(y), for RLC noise, Q = 100, s = wo7’. 


sion of the input voltage will therefore have a longer lasting effect in the 
output of a circuit with a large Q than in a circuit with a small Q. To 
obtain the same variance in power, then, the integration time must be 
longer in the circuit with the large Q value. It would seem reasonable to 
expect this argument to apply for any peaked spectrum, not solely for 
Qe 

If the Q of the spectrum (2) is increased, how much must the integra- 
tion time be increased to maintain roughly the same power fluctuations? 
From (3), it is seen that for large Q, o” is approximately 27°[r — 1 + e’], 
i.e., a function of 


T 


T 


Olé 


=e 
Q 


alone. Now Q measures the relative sharpness of the spectral peak, so 
that wo/Q is a measure of the absolute width of the peak in radians/sec. 
As a rough rule, then, power measurements from different members of 
the family (2) will have the same fluctuations if their products “‘integra- 
tion time” times “absolute spectral bandwidth” are the same. Fig. 11 
shows o’ as a function of r for Q = 1, 10, and 100. That 7 isa good meas- 
ure of the fluctuation in power can also be seen by comparing the f 
curves of equal 7 value in Figs. 3, 4 and 5. They are almost identical. 

* It seems to be very difficult to make any other qualitative statements re- 


garding the relation between the shape of the noise spectrum and the density 
function for y. 
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On Fig. 11 the variance of y for bandpass noise with spectrum 


1 
w(f) = 148’ Vices fo] 


0, l[ftfo|>s 


IA 


6 


is plotted versus 







































































Fig. 9 — RLC spectra, Q = 1, 10, 100. 


FLUCTUATIONS OF RANDOM NOISE POWER 171 


















































2.8 
“| + | 
2:0 + | 
— 2 
S = 277/10 f(y) =sEecpe acre Jo(LYy TAN p SEC ¢) 
1.6 SIN~ = ile 
f(y) Z S = WoT 
1.27-\ T a | a 
eTT/§ 
0.8 t t 
0.4 | ie 
| 
0 ee ee 
(o) 0.4 0.8 1.2 1.6 2.0 2.4 2.8 3.2 3.6 4.0 
Y 


Fig. 10 — Probability density, f(y), for RLC noise, Q = ~. 


and measures the relative width of the spectrum. This definition of Q, 
causes the o” curves of this noise power to agree asymptotically with 
those of the RLC noise power; namely o? ~ 2/7 in both cases. Again, 
when it is not too small, 7 seems to be a good measure of power fluctua- 
tions. The variance in this bandpass case is given by 


2 
1 | c0s Qorysin | 
=a | (1 — y) |; —————— ; dy 
| ary | . 
2 


which can be readily evaluated in terms of Si and Ci functions. The curve 
for Q» = 100 coincides so closely with the curve for Q = 10 it could not 
be shown on Fig. 11. 

The asymptotic agreement of the variance of noise power for band- 
pass and RLC noise permitted defining the Q of the bandpass circuit as 
Q. = m(fo/26). These same considerations suggest defining the band- 
width W of the RLC spectrum by W = a/2Q. For, in the bandpass 
case, r = 2(26)T which is 2T times the bandwidth of the spectrum. For 
the RLC noise, 7 = w7’/Q = 27 (wo/2Q), whence the definition of W 
follows. 
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Fig. 11 — o? for RLC noise and bandpass noise, Q = 1, 10, 100. 


The curves shown in Figs. 1-8 are believed to be accurate to two sig- 
nificant figures. For comparison, some points computed from Rice’s 
approximate formula for f(y) (equation 3.9-20 of [Ref. 1}) are shown on 
Fig. 3. Rice’s formula is seen to fit the tails of f(y) well for large y, but 
the central portion of the distribution is given accurately only for large 
values of r. However, the approximate cumulative distribution obtained 
by integrating Rice’s formula agrees quite well with F(y) for a wide 
range of 7 values as is seen in Fig. 6. 
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The approximation in question assumes a x? type distribution 
(n/2)—1 —(y/2q?) 

: 1 € 

fy) = 7 __., 
esr (5) 


The parameters g and n are chosen to make the first two moments of 
this density agree with the true first two moments of y. That is, for the 
normalization Hy = 1 adopted here, the equations gn = 1 and 2nq‘ = a? 
serve to determine g and n. These formulae give n = 2/o?. Since for 
large 7, o? ~ 2/7 for bandpass noise, n ~ 7 = 2(26)7’. That is, for large 
7, the bandpass noise acts like a x? variate with 2(26)T degrees of free- 
dom in agreement with an argument easily derived from the sampling 
theorem. 


III. GENERAL THEORY 


Let N(¢) be a Gaussian noise with mean zero and covariance 
p(t, ’) = EIN@N(¢)) 


where as usual / denotes expectation. In studying properties of N(é) in 
a finite time interval, say (—7'/2, T/2), it is convenient to make an ex- 
pansion in terms of an orthonormal set of functions, ¢,(¢), 2 = 0,1,2,.... 
We write 


[oo] Tl 
NO® = dine, ltlss 
where 
7/2 
n=] NOv@ at, Leto 
— 7/2 : 


and 
T/2 


i; vi (te; (2) dit = O15 2, J = 0, 2; pre 
— 7/2 


As is well known,’ it is particularly convenient in this description of 
the noise to choose as the orthonormal set, ¢;, the solutions of the 
homogeneous Fredholm equation with p(t, t’) as kernel. That is, the ¢’s 
are chosen so that 


T/2 


reid = [ot De) a, [tls 


nls 


’ all |e) yp a (4) 
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For, with this choice of the ¢’s, it is easily shown that the n; are inde- 
pendent Gaussian variates with mean zero and variance E(n7) = 3, 
t= 0,1, 2,.... We assume in all that follows that the )’s are so labelled 
that Ao = M1 = he = pe ee 

Consider now the average power, y, of a finite sample of the noise. It 
follows that 


cs 
I 


1 Tle ; [os ; 
< t ee ; 
FL ng NO at pm” 





3 (5) 
= Daz’, 
0 
where 
Bi ai 
and 
NG 
a; = vik (6) 


Equation (5) exhibits y as a linear combination of independent random 
variables. The x; are independent Gaussian variables all with mean zero 
and variance unity. The characteristic function, C(é), for y then follows 
readily. One has 


Eetttiz7” 


rs 


C() = Belt’ = Bela? = 


F] 


ll 
° 


(7) 


I 


Il (1 — 2a)”, 
7=0 


Here, as throughout this paper, the positive square root of a complex 
quantity is taken to have an angle between — (7/2) and +(a/2) radians 
(the cut line is along the negative real axis). 

From the characteristic function (7), the semi-invariants of y can be 
calculated. By definition’ of the semi-invariants, Kia 


loz C@ = DS Gy 
From (7) and the expansion | 


log A — x) = -oF, 
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it follows that 





= oOo © +. \i 
log C(é) = | 2! og (1 — 2iéa,) = ; pae> a 
7=0 k=1 j=0 
_ Ge)" 
= 2 Er * 
where 
=(b- 12D peat (8) 


From the semi-invariants, the moments of y can be found as in Refer- 
ence 3. 

The formula (8) for the semi-invariants can be put in a convenient 
form not involving the a; explicitly. From the well known expansion‘ 


p(t, ’) = Dot dses(Hex(t’) 
and the orthonormal properties of the ¢’s, one finds 


(ks iro on a 
ee OS Se. t) dt 9 
Kk id | ats p (i, ) ’ ( ) 


where the iterated kernel p(t, ¢’) is defined by 
po Gt) =), 


T/2 
o(t, t’) = [ ot a) (a, #) do, m= 2,8, 2 


The determination of the higher order iterated kernels generally becomes 
difficult in practice. 

The expression (9) is of the form conjectured by Rice’ on the basis of 
computing the first four semi-invariants of y. The formula (7) was given 
by Kae and Siegert” and (9) was noted by Arthur’ in a special case in 
connection with the analysis of a frequency discriminator. 

The probability density function for y is obtained as the Fourier 
transform of C(&), 

2 e—tky 
fly) = eee (10) 
ia I (1 — 2¢¢a,)"” : 


and the cumulative distribution function can be written as 


ry) =1— | fe) ae. (11) 
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Much of the remainder of this paper will be concerned with evaluating 
(10) and (11) for specific noises. 


IV. COMPUTATIONAL FORM FoR f(y) 


The evaluation of the integral (10) presents many difficulties even 
with modern computing machinery. From the physical origins of the 
problem under discussion, it is clear that for small values of 17’, f(y) 
must be a rather broad function (non-localized), whereas for large values 
of 7 it must approach a 6-function centered at the point y = p(0, 0) 
when the noise is assumed ergodic. The behavior of (10) therefore de- 
pends in detail on the manner in which the a; approach zero with in- 
creasing 7. 

One seemingly attractive approach to the problem is to truncate the 
sum in (5) at 7 = M and correspondingly obtain a product with 7 run- 
ning from 0 to M in the denominator of the integral in (10). Procedures 
are described in the literature®7 for computing the distribution of a 
finite quadratic form in Gaussian variables. Estimates of the error due to 
truncation can also be obtained rather readily. Unfortunately, the best 
such estimates obtained by the author showed that for small values of 
6 or r, M must be taken quite large (50 or 60) to obtain answers guaran- 
teed accurate to two decimal places. Furthermore, the convergence of 
the computational schemes described®: 7 turned out to be very slow. The 


E PLANE 





Fig. 12 — Cut lines and contour in complex ¢ plane. 
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following alternative approach which can also be applied to finite sums 
of the form (5) was used to obtain the curves presented here. 

The function (1 — 2¢éa,)'” in the denominator of (10) has branch 
points at —7b; , where 


=e 


1 oe 
aor Dy? f= 0,1, oases v2) 


The b’s are real positive quantities, for the \’s are eigenvalues of a real 
symmetric positive definite kernel and must be real positive numbers. 
Line segment cut-lines are inserted in the complex £-planeifrom —7b2; to 
—thej41,7 = 0, 1, 2,... as shown in Fig. 12. When y <0, the value of 
(10) is zero as can be seen by closing the contour in the upper half plane. 
When y = 0, the contour of integration in (10) is displaced from the 
axis of reals to the contour, C, shown in Fig. 12. This displacement of 
contour is easily justified if II(1 — 2ita;)"” is of exponential order less 
than unity, a condition which will be fulfilled in the examples to be 
treated. The change of variable ¢ = 2& rotates the contour of Fig. 12 by 
90° in the positive direction. If one now collapses the closed contour 
curves about the cut-lines and takes proper care of the convention al- 
ready set forth for the square root sign, there results, 


fy) = 20-17, 
where 


1 bex+1 gut dt 


i= = RSS ee 1 ee 
: W Yboy Vi Dw’ ae 


and where 
Dit) =T] (1 = (13) 
7=0 j 


D(®) is closely related to the Freeholm determinant (Reference 4, Chap- 
ter 11) of p. 
In the application to be treated below, 
D(t) = H(z) . (14) 


where 


= g(t) | (15) 


is a non-negative monotone increasing real function of ¢ for t 2 bo. De- 
note its inverse by t = h(z). Let 
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ze = g(bx) | 

Ck = 3 (Zon41 — Zo) 

dy, = F(Zon41 + 22x) 
fork = 0, 1, 2,... and let 

z= ¢, cos 7x + dy. 


Then straightforward substitution yields 
MON (2) dx dz 





I, -[ a Hay C1, 6) 
(2 — 2x) (Zenza — 2) 
fy) = Dito)". (17) 
Similarly, one obtains 
Fly) = 1 — dopo(-1)" (18) 


with 
e—vh(2) h (z) ax 
A) 


(2 — 22) (Zee — 2) 


(19) 


see 7 a on 


Equations (16) to (19) were used to compute the curves discussed in 
Section II. The denominators of the integrals in (16) and (19) have no 
zeros in the range of integration. By use of Gauss’s method of numerical 
integration,’ evaluation of the integral at x = 0 and x = 1 where the 
denominator is an indeterminate form was avoided. In the applications 
made, it can be shown that for sufficiently large k, Ii, and J; decrease 
monotonely. Since the series (17) and (18) are alternating, an estimate 
of the error made by terminating the series at a finite value of k can 
be obtained. In all cases computed, it was never necessary to take k 
larger than 18, to obtain 1 per cent accuracy in the final result. 





V. DETERMINATION OF EIGENVALUES AND H(z)* 


For stationary processes, the kernel of the integral equation (4) be- 
comes a difference kernel; that is, p(é, t’) = p(t — ¢t’) where p(x) is a 
positive definite function. The Fourier transform of p, namely 


wif) =f ” BF (a) de 


is non-negative and is the power density spectrum of the processes. 





* An alternative method of evaluating H(z) is described in Reference 12. 
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Analytic solutions to the integral equation (4) are known in this case 
only for a relatively small class of kernels. Fortunately, this class is one 
of considerable interest in communication applications. It is the class 
of p whose spectra w(f) are rational functions of f?; i.e., ratios of poly- 
nominals in f?. Such spectra are obtained by passing white noise through 
a finite passive physical electrical network with lumped constants. De- 
tails of the method of solution are given in References 9 and 10. It 
must be pointed out that, even in this case, solutions can be carried out 
practically only for polynomials of small degree. 


5.1 RC Noise 


If white Gaussian noise is applied to a series RC circuit, the voltage 
across the capacitor has a power density spectrum proportional to 


2a 
a? + Af? 
where a = 1/RC is the nominal cut-off frequency of the circuit. The co- 
variance function corresponding to (20) is 


p(t) = A, (21) 


Solutions to (4) with this kernel are given in detail in both References 
9 and 10. 


w(f) = (20) 


Let 
aT 
B= oe (22) 
Then 
1 
b, = 3B [6° + x’, k = 0,1,2,..., (23) 
where the z are non-negative roots of either of the equations 
ztanz = B (24) 
zcotz = —8. (25) 


If the z, are labelled so that 2. < 21 < 22 <..., then it is readily seen 
that 2, ~ k(2/2), so that b, ~ k’(x’/88). The convergence exponent (see 
Reference 11, p. 14) of the sequence b; is therefore 4. It follows then (Ref- 
erence 11, 2.6.5, p. 19) that D(é) as given by (18) is an entire function of 
order 3. 

Now the function (e°°/g)[8 cosz + z sin z] [cosz + A(sin z/z)], where 
z = ~/2p8t — B, is an entire function of t of order }. Its only zeros are 
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at the points ¢ = b, , k = 0, 1, 2,..., given by (23), (24) and (25). At 
i.= 0, it has the value unity. It is, therefore, equal to D(t) as can be seen 
from Hadamard’s Factorization Theorem (Reference 11, 2.7.1, p. 22). 

The quantities necessary to evaluate (16) and (19) are therefore all 
known for this case: 


B . 
H(z) = “UB cos 2 — z sin 2] cos rd 





Zz 
1 2 2 
h(z) = 53 l8 + z] 


and the 2 are given by the positive roots of (24) and (25). 
The first two semi-invariants of y are found to be 


nn = ky = 1, 


I 


K2 


1 = 
By — 1? =o = 7a [4p —1 +e") 


5.22 RLC Noise 


If white Gaussian noise is applied to a series RLC circuit, the voltage 
across the resistor has a power density spectrum proportional to 


w(f) = QO (26) 


2 
Oe i (2) (it — 02)" 
Wo 


where w = 2xf, Q = wol)/R and w = 1/LC. Introducing parameters u 


and v defined by 
wt = wilt — 2| 


UW = wW, Re u 2 0, Rev 2 0, 
one finds 
= 2(u + vw" 
MD = CEE EF A) 
and 


i 
u—v 





p(r) = [ag Ol ae ge], (27) 
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In the special case Q = 3, 
p(r) = (1 — wo] 7 [eo “0 (28) 


Solution of (4) with (27) or (28) as kernel is relatively straightforward 
by the methods of References 9 and 10, although quite laborious. Details 
can be found in Appendix 2. 

Suppose Q and w are positive real quantities. Then the eigenvalues 

= T'/2b, are given by 


bb = = (P +e], %&b=0,1,2,... (29) 
where the z, are non-negative roots of either 


2r cos z = (2 a +r 2) sin V2? + 5? (30) 
Ve + 8 


or 
: m2 LL a 
or cos 2 = (2 — rf) SZ go ty MVE+F gy) 
zZ V2 + 3 
Here 
r - oF S = wT’ (32) 


The eigenfunctions belonging to roots of (30) are of the form 
A, cos $(% + Vad + s)t + Bi cos 3% — Va? + s%)t 
while those belonging to roots of (31) are of the form 
Cy sin 3(a + Va? + s*)t + De sin 3% — Va? + s*Jt. 


It is interesting to note that when the )’s are ordered in the usual way, 
the corresponding eigenfunctions do not in general alternate between even 
and odd functions of ¢. 

The infinite product (13) with the b’s given by (29), (80) and (81) 
can be written in closed form by arguments similar to those used in 
Section 5.1. From (80) and (81), it is seen that asymptotically succes- 
sive z, are separated by 7/2, so that b, grows like k’ and one is again 
dealing with an entire function of order 3.* For the pertinent quantities 


* More generally, it can be shown that for rational spectra if w(f) ~ f-?” then 
An ~ nv ?, (Private communication to author by A. Beurling.) 
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of (16) and (19), one finds 
—2r 
een sin 2 _ 
H(z) (3 | - r’) : 2r cos Zz 


2 2 sin V 2 + “| 
+ (¢ +1) ye (34) 
[@ ~ r’) sm " —~ 2r cosz — (2 + 7°) ee See a =] ; 


h(z) = = = +r] (35) 





with the z;, given as roots of (30) and (31). 
The first two semi-invariants of y are 


w= Hy =1 


2 —2r 


2 1 —2r reé ° 42 <i arT 
U=o = gar -1 +e + 25-3 sinh vial, 
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APPENDIX 1 


Let N() = acos wot + 0b sin wot, where a and b are independent Gaus- 
sian variates with mean zero and variance unity. Then y as defined by 
(1) is obtained by direct integration as 


y = aa + Bb’ 


_ 3 (1 # sin a Ga (1 = sm ), ‘oath 


Since y is the sum of independent random x’ variables, the density for y 
can be obtained as the convolution 


fe el 2a) e! (y—z) /2B) 
Iu) = VJ Qrax VW 2rB(y — a 


The substitution « = (y/2)(1 + cos 6) in this integral leads to 


where 
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oe YIM (Ala) +018) 1 T 
2V/ ap w Jo 


—(y/4) [Q/o)+(1/8)] , 

é€ a | 1 

eS ae 
2/ a8 a pp 4 


Finally, if sin g = sin s/s, 


oul) (at B-}) cos @ de 


tY) 


fly) = secge “* Jo(iy tan o sec ¢). 


APPENDIX 2 


The power spectrum corresponding to the covariance (27) can be 
written as 


2(u + v)p” 
PH =v) 


where p = iw = 2mif. From Reference 9, then, solutions to (4) with 
the kernel (27) must satisfy the differential equation 


2 _ #\(2-s') of =0, ( 
(a - “) Ge ~#) 


_ 2(u + 2) 
r 


w= 


or 


where 


a ae 8 = ue = y , 
a3” _— ue" = ae 

We choose a and 8 so that Rea 2 0, Re 8 2 0. Ifa ¥ 8, then gisa 

linear combination of the elementary functions e*’, ¢ “’, e*, e& *. 

It is easy to verify that if ¢ is a solution to (4) with a kernel p(é, ¢’) = 
p(|é¢ — # |), then g(é) + y(—é) and g(t) — ¢(—#) are also solutions. 
We can, therefore, restrict attention to even and odd solutions of (4). 
On substituting 


g(t) = A cosh at + B cosh Bt 
~ into (4), one finds 
Xr or Br 


PT -@-Aae- A  @-ae-w 
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1 cosh © + a sinh a u cosh e + Bsinh a 
Se aR i ear ey Tea 
(ili) 
v cosh “ +a sinh v cosh pr +B Ree 
| ee canes; eae ee ey 
v—a y? — B 


The determinant of the system (iii) must vanish. A bit of algebra shows 
this to be equivalent to 


2r cosh x + (a2 + 7°) sinh & (2? — 1’) sinh V2? — 5° a 0, 
x / pete Ae 
where « = (a + 6)(7'/2). It is not difficult to show that for positive w» 
and Q, this equation has roots only if a and 8, and hence x, are pure 
imaginary. Writing x = 7z, (iv) become (80) and (ii) yields (29). 
The substitution of 


g(t) = C sinh at + D sinh Bi 


(iv) 


into (4) again yields (ii) and equations analogous to (iii) with sinh and 
cosh interchanged. A similar analysis then gives (81). 

If a = £, then from (i), ¢ must be of the form A cosh at + Bi 
sinh at or C' sinh at + Dé cosh at. Substitution of these forms into (4) 
yields equations which cannot be satisfied for positive wo and Q except 
by the trivial solution A = B= C= D= 0. 
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The Measurement of Power Spectra from 


the Point of View of Communications 


‘Engineering — Part I 


By R. B. BLACKMAN and J. W. TUKEY 


(Manuscript received August 28, 1957) 


The measurement of power spectra is a problem of steadily increasing im- 
portance which appears to some to be primarily a problem in statistical esti- 
mation. Others may see it as a problem of instrumentation, recording and 
analysis which vitally involves the ideas of transmission theory. Actually, 
ideas and techniques from both fields are needed. When they are combined, 
they provide a basis for developing the insight necessary (2) to plan both the 
acquisition of adequate data and sound procedures for tts reduction to mean- 
ingful estimates and (ii) to interpret these estimates correctly and usefully. 
This account attempts to provide and relate the necessary ideas and tech- 
niques in reasonable detail. Part IT of this article will appear in the March 
issue of THE JOURNAL. 
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Communications systems and data-processing systems are generally 
required to handle a large variety of signals in the presence of noise. 
The design of these systems depends to a large extent upon the statisti- 
cal properties of both the signals and the noise. In most cases, the noises 
may be represented, or approximated, as stationary Gaussian random 
processes with zero averages, so that all of their relevant statistical prop- 
erties will be contained by the autocovariance function or the power 
spectrum. In many cases, the signals may also be represented, or ap- 
proximated, as stationary Gaussian random processes with zero averages. 

Noises, signals, or other ensembles of functions (given continuously or 
at intervals) which are approximately stationary but not Gaussian are 
often also usefully studied in terms of autocovariance functions or power 
spectra. Although the average and the spectrum are no longer the only 
relevant statistical properties, they are usually the most useful ones. 
Thus, we shall do well to keep as much of our treatment generally appli- 
cable as possible. 

In almost every case, the autocovariance function or power spectrum 
of either the noise or the signal will be of interest and importance. 

To determine the autocovariance function or power spectrum of an 
(approximately) stationary random process, we are often reduced to 
the necessity of measurement and computation. Exact determination 
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would require a perfectly-measured, infinitely-long piece of a random 
function (or a collection of pieces of infinite total length), and would re- 
quire infinitely detailed computations. Both of these requirements are, 
of course, impractical. Approximate determination, on the other hand, 
raises the questions of how much data of a given accuracy will be re- 
quired, what computational approach should be used, and how much re- 
liance may be placed upon the results. Practically useful answers to 
these questions may be found by combining results from transmission 
theory and the theory of statistical estimation. These answers prove to 
be relatively simple. The only major difficulty in their practical applica- 
tion is the extensiveness of the data required for highly precise estimates. 
This requirement is an inherent, irrevocable characteristic of such ran- 
dom processes. 

In this account we shall treat only the measurement of spectra of in- 
dividual noises or signals. The measurement and utilization of the cross- 
spectra of pairs of series is also important, but is beyond our present 
scope. Questions of distribution and anticipated variability of cross- 
spectral estimates, and of certain estimates derived from them, have re- 
cently been cleared up by Goodman.’ 

It is natural to feel that the measurement of power spectra is simple, 
and that no problems deserving extended discussion arise. After all, are 
there not commercial ‘‘wave analyzers” of many sorts; have not Fourier 
- series served for many years to analyze the frequencies of many signals, 
(musical instruments, human voices, etc.)? Why should there be a serious 
problem? 

There are two reasons why elementary methods fail us rather fre- 
quently. On the one hand, the signal may not be available in indefinitely 
long time stretches. Hither the conditions of observation, experimental 
or otherwise, or the difficulties of careful recording may make it imprac- 
tical to have so much data that we can afford to analyze carelessly. (The 
examples of Sections 26 to 28, involving spectra of radar tracking, 
noise in very short-lived devices, and irregularities in the earth’s rota- 
tion, respectively; all illustrate this point). Even if observation and 
recording can be afforded, the cost of computation often forces careful 
analysis. 

On the other hand, the random nature of much noise, and some sig- 
nals, in which the relative amplitudes and phases of different frequencies 
are not stably related (in contrast to voices and musical notes), intro- 
duces much more difficulty with sampling fluctuations and provides 
much more significant appearing, thus much more misleading statistical 
artefacts than experience with simpler signals would lead investigators 
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to expect. (In postwar oceanography, for example, high mechanical in- 
genuity was expended in the construction of simple and effective wave 
analysers to produce detailed spectra of ocean waves. The results were 
quite misleading, because the frequency resolution obtained was too high 
for the limited length of records used, and almost the entire appearance 
of the resulting spectra was an illusion due to the particular fluctuations 
of the particular record. The use of broader filters has since led to mean- 
ingful results which could be related to physically satisfying theories.) 
All too often, the practical study of spectra requires care. 

Effective measurement of power spectra requires understanding of a 
number of considerations and action guided by all of them. Explaining 
each individual consideration is necessary, but it is equally necessary to 
explain how they fit together. The general structure of this description 
of spectral measurement is the following: an introduction to the concepts 
(Sections 1-3), brief accounts of individual considerations (Sections 
4-19), accounts of how these considerations are assembled in analysis 
(Sections 20-21), and planning for measurement (Sections 22-28, 
which include discussion of examples), and Sections in Part II giving 
the details supporting the earlier sections. 

We have attempted to provide, somewhere, most of the facts and atti- 
tudes that are needed in the practical analysis of (single) power spectra. 

Readers interested in either completing their present knowledge or in 
gaining a brief overview of the subject may wish to proceed next to Sec- 
tions 20ff, whence they can be referred to specific sections of interest. 
For some, reading of Sections 1-3 may be a helpful preliminary for Sec- 
tions 20ff. For others, who want to build more solidly as they go, reading 
straight through, perhaps with considerable cross-reference to Part II, 
may be best. 

A function of time X(t) generated by a random (or stochastic) process 
is one of an ensemble of random functions which might be generated by 
the process. The value of the function at any particular point in time is 
thus a random variable with a probability distribution induced by the 
ensemble. Furthermore, the values of the function at any particular set 
of points, say ¢ = 4, &, ---, ¢, , have an n-dimensional joint probability 
distribution also induced by the ensemble. Such probability distributions 
have an important bearing on the design of any communication system 
or data-processing system which must handle an output from such a 
random process, be this output “‘signal’’ or “noise’’. 

We shall often, but not always, assume that the random process is 
Gaussian. This means that, for every n, i, 2, ---, tx, the joint proba- 
bility distribution of 
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X (hi), X(t), ean) X (tn); 


is an n-dimensional Gaussian or normal distribution. Each such distribu- 
tion is completely determined by the ensemble averages 


X(i;) = ave {X(t)}, 
and by the covariances 
Ci; = cov {X(t,), X(¢;)} 
= ave {[X(4) — X(4)] [X,) — X@)]}. 


As a matter of convenience in development we will assume that the 
averages X(é;) are zero. The covariances then reduce to 


Ci; = ave {X (é;)-X(é;)}. 


Throughout, we will assume that the random process is statéonary (that 
is, temporally homogeneous) in the sense that it is unaffected by trans- 
lations of the origin for time. The covariances C;; now depend only on 
the time separation t; — ¢; so that 


Ci; = Cit; = t;). 


Thus, the noise is completely specified by a single function of a single 
variable. In particular, C(0) is the varzance (for zero average, the average 
square) of X(?). 

If the process were stationary, with zero averages, but were not 
Gaussian, then knowledge of the covariance as a function of lag, although 
providing a very large amount of useful information, would not com- 
pletely specify the process. The results of this paper fall into two cate- 
gories: (i) those relating to average values of spectral estimates, and 
(ii) those relating to variability of spectral estimates. The average-value 
results apply generally under the assumptions of stationarity (and zero 
averages), and do not depend upon the Gaussian assumption. The varia- 
bility results are exact under the Gaussian assumption, and are usually 
rather good approximations otherwise. Thus, our results have practical 
value for noises and signals which are not closely Gaussian. . 

Results about variability are naturally used: (i) for planning the ap- 
proximate extent of measurement effort, (ii) for indicating the presence 
of changes, during a series of measurements, in the quantities estimated, 
and (iii) as a means of judging the precision of an over-all estimate. The 
results given here are mainly for the first planning use. The additional 
uncertainties in actual variability due to either non-normality of distri- 
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bution, or to changing of conditions between runs, or to both, are often 
all too real, but are rarely large enough to affect planning seriously. The 
same is true of mild nonstationarity. The Gaussian, stationary results 
can also be applied to the second use, the detection of changes in the true 
spectrum, but considerable caution is in order. The precision of final 
over-all values is ordinarily far more wisely judged from the observed 
consistency of repeated measurements (as by analysis of variance of 
logarithms of various spectral density estimates at the same nominal fre- 
quency) than from any theoretical variability based on a Gaussian as- 
sumption. 

Communications engineers are more accustomed to work with a single 
time function of infinite extent than with an ensemble of finite pieces (of 
such functions). It is perhaps fortunate, therefore, that averages across 
an ensemble are equivalent (ergodicity) to averages over time along a 
single function of infinite extent, whenever a process is stationary, 
Gaussian, has zero averages, and has a continuous power spectrum (no 
“‘lines’’). (If the process were not stationary the single function approach 
could not be used in this way.) 

Since we seek to make this account as intuitive as possible for com- 
munications engineers, we shall define transforms, and make many other 
computations in terms of averages along a single function (as limits of 
integrals over centered intervals). In dealing with more specifically sta- 
tistical issues, however, we shall write “ave” for average value, “var” 
for variance and “cov” for covariance, and shall do nothing to hinder 
the interpretation of these operators as acting across the ensemble. 
(Those who wish can also think of them in single function terms.) 

The covariance at lag 7, in single function terms, is given by 

T/2 


ae 3 
CG) = lim 5 [ _ XO-X6 + 2)-dt 


In ensemble terms, we would write merely 
C(r) = ave {X(t)-X(¢ + 71)}. 


The function C(7) is frequently called the autocorrelation function, 
although historical usage in both statistics and the theory of turbulence 
(Taylor’) shows that this name should be applied to the (normalized) 
ratio C(r)/C(0). We shall call C(r) the autocovariance function. This 
name is appropriate to our formal definition of C(r) because we have 
assumed that the averages of our process are all zero. Whenever we give 
up the assumption of zero averages, as we must almost always do when 
dealing with actual data, we shall use 
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ave {X(¢)-X(t + 7)} = average lagged product, 
ave {[X(t) — X]-[X(é + 7) — X]} = autocovariance function, 


where X is the common value of ave {X(t)} and ave {X(é + 7)}, thus 
preserving accurate usage. 

Because of the direct relationship of the joint probability distribution 
to the autocovariance function, much of the statistical attention given 
to Gaussian stationary time series (time-sampled random functions) has 
been expressed in terms of serial-correlation coefficients (corresponding 
to lag-sampled autocovariance functions). 

A stationary Gaussian random process may be regarded (e.g. Rice’) as 
the result of passing white Gaussian noise through a fixed linear network 
with a prescribed transmission function. White Gaussian noise, in turn, 
may be regarded as the superposition of the outputs of a set of simple 
harmonic oscillators (continuously infinite in number) with 

(a) a continuous distribution in frequency,* 

(b) uniform amplitude over the significant frequency range of the 
transmission system, and 

(c) independent and random phases. 

This point of view is particularly suited to the techniques employed by 
communications engineers. It is therefore not surprising that communi- 
cations engineers have dealt with stationary Gaussian random processes 
almost entirely in terms of power spectra. 

Because the autocovariance function and the power spectrum are 
Fourier transforms of each other, it would at first appear to be purely 
a matter of convenience which one is used in any particular situation. 
Indeed, optimum filter characteristics for the protection of signal against 
noise in communications systems and in many types of computing de- 
vices have, on occasion, been determined by the use of the autocovari- 
ance function. In practice, however, the filter designer almost invariably 
turns to the power spectrum as the final criterion of adequate design 
and performance. 

In practice also, where the autocovariance function or the power spec- 
trum must be determined by measurement and computation, and then 
interpreted, the choice is now heavily weighted in favor of interpretation 
of the power spectrum. Although a great deal of theoretical work has 
been done on the probability distribution of the serial-correlation co- 
efficients for Gaussian stationary time series of finite length, with a view 
to the estimation of the confidence which may be placed upon practical 

* The term “frequency” is used throughout this paper in the communications 


engineer’s sense, viz., cycles per second of a sinusoidal wave. (Exceptional uses 
in the statistician’s sense are explicitly noted.) 


MEASUREMENT OF POWER SPECTRA 193 


results, the criteria which have been developed along this line are so 
complicated that it is extremely difficult to apply them in practice, 
where the joint distribution must be considered. On the other hand, the 
situation with respect to the power spectrum is now very satisfactory 
for practical purposes. This stems from results obtained by Tukey,’ 
and in part independently by Bartlett,’ about nine years ago, when 
studies were made of the effects of sampling, of finite length of series, 
and of choice of computational procedure on the behavior of the esti- 
mated power spectrum. Since that time, applications to such diverse 
fields as ocean waves (Marks and Pierson®), aerodynamics (Press and 
Houbolt’), meteorology (Panofsky*), and seismology (Wadsworth, 
Robinson, Bryan, and Hurley’), have shown the practical applicability 
of these results to a wide variety of physical time series. 

Shortly after these studies first reached the stage of practical useful- 
ness, the theoretical analysis was reformulated by Blackman, who ex- 
pressed it from the point of view of transmission theory, for presentation 
to members of the technical staff of Bell Telephone Laboratories 
(Out-of-Hours Courses 1950-1951, Communications Development 
Training Program 1950~1952). 

More recent contributions (1950-1957) to the theory of power spec- 
trum estimation have been reviewed by Bartlett and Medhi,” by Bart- 
lett,” and by Grenander and Rosenblatt.” 


2. AUTOCOVARIANCE FUNCTIONS AND POWER SPECTRA 


First, let us consider the ideal case. The autocovariance function which 
was defined in the preceding section by 


T/2 
Clr) = lim, [XOXO + 2nd 
too 1’ J-rle 
may be reduced to the form 
CG) = [P(e af 
where 
1 T/2 ; 2 
P(f) = lim r| [ Xjse oT ai 
too 1'| J-rl2 
(cp. Section B.2). The function of frequency P(f) describes the power 
spectrum of the stationary random process considered. More precisely 


P(f) df represents the contribution to the variance from frequencies be- 
tween f and (f + df). If we think of X(t) as a voltage across (or current 
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through) a pure resistance of one ohm, the long-time average power dis- 
sipated in the resistance will be strictly proportional to the variance of 
X(t). This important special case is the excuse for the adjective “‘power’’. 
The pure statistician might prefer to refer to the covariance spectrum or 
to the second moment spectrum rather than to the power spectrum. 
For precision, we shall often refer to P(f) as the spectral density or 
power spectral density. When no confusion is likely, we may call P(f) 
merely the power spectrum. 

The relation exhibiting the autocovariance function as the Fourier 
transform of the power spectrum may be inverted to express the power 
spectrum as the Fourier transform of the autocovariance function. Thus, 
we have 


P):= [ C(r) 6 P dr. 


The autocovariance function C(r) and the power spectrum P(f) are, 
formally at least, even functions of their respective arguments. Hence, 
the relation between them may be expressed more simply as two-sided 
cosine transforms, viz. 


cr) = f : P(f) cos 2afr-df, 
and 
Pp) = [ C(r)-cos 2nfr-dr; 
or perhaps even more simply, as one-sided cosine transforms, viz. 


C(r) = 2 [ P(f)-cos 2xfr-df 
and 
P(t) =2 if C(r)-cos 2xfr-dr. 


Results are usually more conveniently developed in terms of the two- 
sided forms than in terms of the one-sided forms. In Sections A.3 and 
B.4 for example, the use of the two-sided forms with exponential kernels 
will be found to simplify considerably the expression of the operation of 
convolution between functions of lag or of frequency. In Section B.6, the 
use of the two-sided forms with exponential kernels avoids some compli- 
cated manipulations of trigonometric identities in the early stages of the 
development. 
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It should be particularly noted that 
ave {X()-X(t + 7)} = , 2P(f)-cos Infr-df 
and that (setting 7 = 0) 
var (X(} = f° 2PCA) af. 


Thus, it is evident that our definition of the power spectrum differs from 
the usual one which associates the power spectrum only with positive 
frequencies. References to the power spectrum in practice are usually 
in terms of a density 2P(f) over 0 S f < @ only. 


3. THE PRACTICAL SITUATION 


In practice we can obtain only a limited number of pieces of X(¢) 
of finite length. Each piece may be regarded as a sample drawn from a 
population or universe of pieces of X(t) of the same length. The reduc- 
tion of the data will therefore yield no more than estimates of the auto- 
covariance function and of the power spectrum — estimates which are 
subject to sampling variations and to biases in the usual statistical sense. 
This situation is further complicated in those cases in which we can 
measure, or desire to use, only values of X(¢) at uniformly spaced values 
of ¢ within each piece of X(t); in other words, those cases in which we 
are dealing with classical time series (discrete time) rather than with 
time functions (continuous time). 

The theoretical study of sampling variability and bias is much simpler 
in the case of the estimates of the power spectrum than in the case of 
the estimates of the autocovariance function (or of serial-correlation co- 
efficients). This reflects the fact that, as we consider longer and longer 
~ records, and two narrow frequency bands with an arbitrarily small but 
fixed separation, we may find estimates of the power in these frequency 
bands which both (i) become arbitrarily precise, and (ii) become arbi- 
trarily nearly (statistically) independent. The existence of such esti- 
mates is another particular consequence of the Gaussian character, as 
expressible in terms of “random and independent phases”, of the ran- 
dom process from which we have one or more samples. 

Use of the power spectrum has an additional advantage over use of 
the autocovariance function. In almost all practical situations, the data 
analyzed does not represent the actual output of the random process. 
In such cases the data will have been modified, appreciably if not radi- 
cally, by the transmission characteristics of the devices employed to se- 
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cure the data. This modification of the data may in fact be intentional, 
as we shall see when we come to the discussion of “prewhitening”’ in 
Section 15. In any case, the estimates will have to be corrected for the 
effects of this modification of the data. For estimates of the power spec- 
trum, the correction procedure is a simple division of a frequency func- 
tion by another frequency function. For estimates of the autocovariance 
function, however, the correction procedure will require a Fourier trans- 
formation, division of the resulting frequency function by another 
frequency function, and an inverse J’ourier transformation. This whole 
sequence of operations on the autocovariance function is the only prac- 
- tical procedure for the inversion of the convolution (see Appendix A.3) 
which is the effect to be corrected for. (Details are given at the end of 
Section B.3.) 

As we shall see, the measurements and computational operations may 
involve the use of either analog or digital computation and handling of 
either continuous “signals” or discrete data. (Whatever be its rela- 
tion to some communication or data-handling system, we shall call con- 
tinuous-time signals or noise which we are analyzing “signals”, while dis- 
crete-time signals or noise, or discrete-time samples thereof, will be 
called data.) In actual practice, and for well-defined reasons of in- 
strumentation and computation engineering, only a few of the many 
possible combinations are used. 

Spectrum analysis by analog computation is almost always applied 
to continuous ‘‘signals”, and makes use of filtering rather than going 
through autocovariance or mean lagged products. Digital computation 
must be carried out on discrete data, perhaps time-sampled from a 
continuous “‘signal”’, and preferably uses an indirect route via mean lagged 
products rather than trying to isolate individual frequency bands di- 
rectly. In either case, each data value must enter several computations, 
and it is rarely economic to carry these computations out directly in real 
time, especially since there will not usually be enough such analysis on a 
regular basis to saturate the working capacity of the analog or digital 
computer used. Consequently, recording, either of ‘‘signals”’ or of data or 
of both, is almost inevitable. 

Thus, five stages will be important in nearly every case: 

(1) sensing (pick-up, conversion, etc.) 

(2) transmission (to recorder or, possibly, to computer) 

(3) recording (including play-back, and, perhaps, time-sampling) 

(4) computation (formulas, computing circuit performance, etc.) 

(5) interpretation. | 
In every one of these stages, quality of performance (noise level, dis- 
tortion, etc.) will be of importance. 
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The present account concentrates on the computational and inter- 
pretational stages, but indicates, from time to time, those considerations 
in the other stages which are peculiar to power spectrum analysis. 

We have been unable to find a wholly satisfactory arrangement for 
the material we wish to present. In order to facilitate a relatively easy 
once-over, these introductory sections now continue into a condensed 
account, from which proofs, some reasons, and many helpful remarks 
have been postponed to the Appendix and sections in Part IT. Readers in- 
terested in a survey may find it adequate to read only the condensed 
account. Others may find it best to skim this condensed account first, to 
read Appendix A next, and then to study similarly numbered sections of 
Part II and the condensed account. 

The continuous record of finite length will be treated first (Sections 
4-11); the modifications required for the discrete equally spaced record 
are covered next (Sections 12-21), and the opening account concludes 
with a discussion of the planning and analysis of measurement programs 
(Sections 22-28). 

Appendix A (Sections A.1 to A.6) treats fundamental Fourier tech- 
niques, and the transform-pairs most closely associated with diffraction, 
in both the continuous and equi-spaced cases. 

Each section of Part II relates to the similarly numbered section of 
the main body, and contains details of derivations, further reasons, and 
additional helpful remarks. 

Definitions of the technical terms, arranged alphabetically for ref- 
erence, are included at the end of Part I. Similar definitions of the nota- 
tion will be given at the end of Part IT. 


Continuous Recorps oF FINITE LENGTH 
4, FUNDAMENTALS 


Given a continuous record of finite length, it is clear that we cannot 
estimate the autocovariance function C(r) for arbitrarily long lags. 
Surely, no estimate can be made for lags longer than the record. Fur- 
thermore, as we will find in due course, it is usually not desirable to use 
lags longer than a moderate fraction (perhaps 5 or 10 per cent) of the 
length of the record. Thus, in place of 


T/2 


= ] 1 ° . 
C() = lim 7 i X()-X(t + 1) dt 


for all values of 7, we will have at our disposal 


1 (TPn-'r|)/2 x) r 
Coola) ToS | leds e (: age ( i: 5a Sena 
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only for|7| S$ 7m < 7, where 7’, is the length of the record, and 7'» 
is the maximum lag which we desire to use. We will call Coo(r) the ap- 
parent autocovariance function, since (on account of ergodicity) its 
average value is C(r) for|7| S Tn. 

The class of estimates for the power spectrum with which we are chiefly 
concerned will be derived from a modified apparent autocovariance function 
by Fourier transformation. While the modified apparent autocovariance 
functions, which are obtained by multiplying the apparent autocovari- 
ance function by suitable even functions of 7, are often far from being 
respectable estimates of the true autocovariance function, their trans- 
forms are very respectable estimates of smoothed values of the true spec- 
tral density. 

Let D,(r) be a prescribed even function of 7, subject to the restrictions 
D0) = 1, and D,(r) = 0 for|7| > T,, (where 7 = 0, 1, 2, 3, 4, de- 
pending upon the shape of D,(7) for | 7| < Tn), and let the correspond- 
ing modified apparent autocovariance function be defined by 


Ci(7) = Dj(7)-Coo(7). 


We may regard D,(7r) as a window of variable transmission which modi- 
fies the values of Coo(r) differently for different lags. It is therefore 
natural to call D;(r) a lag window. 

For any lag window which meets the conditions stated above, C’,;(r) 
is calculable from the data. Further, it is clear that C,(r) = 0 
for |r| > Tm although Coo(7) was not defined there. Because C,(r) 
is defined for all values of 7, it has a perfectly definite Fourier transform 
Pf), which should satisfy the symbolic relation, 


Pf) = Qf) * Polf) 


where Q,(f) is the Fourier transform of D;(r), the asterisk indicates con- 
volution (see Appendix A.3 for discussion), and Poo(f) is the Fourier 
transform of Coo(7). However, Po(f) is not determinate because Co(7) 
is not specified for |7| > 7m (and its definition cannot be directly ex- 
tended beyond | 7| = 7’). Nevertheless, since 


ave {C(r)} = D(r)-C(r) 
where C(7) is the true autocovariance function, it follows that 
ave {P.(f)} = Q.(f) * P(f) 


where P(f) is the true power spectrum, that is, the Fourier transform 
of C(r). The average may be thought of as either across the ensemble, or 
along time. (The latter type of averaging would correspond to replacing 


MEASUREMENT OF POWER SPECTRA : 199 


X(t) by X(é — A), thus changing the stretch of X(t) which is observed, 
and then averaging over \.) The corresponding explicit relation, viz. 


ave (P(A) = [ ; Qi(f: — f) PCS) -af 


exhibits the average value of P,(fi) as a smoothing (average-over-fre- 
quency) of the true power spectrum density P(f) over frequencies ‘‘near’’ 
fi with weights proportional to Q,(f; — f). In a manner of speaking P,(f1) 
is the collected impression of the true power spectrum P(f) obtained 
through a window of variable transmission Q;(fi — f). It is therefore 
natural to call Q,(f) the spectral window corresponding to the lag window 
D,(r). 

The form just given for ave {P,(f,)} is natural for our two-sided defi- 
nition of power spectra, but, in order to view the result from the stand- 
point of transmission theory for real-valued signals, it is convenient to 
express the result in a form appropriate to a one-sided definition of power 
spectra. Taking advantage of the fact that Q.(f) and P(f) are even func- 
tions, we may write 


ave {2P,(f1)} = [ Hi(f; fr) -2P(f) -af 


where 


Af; fd) = Of + fd + O¢f — fr) 


and where we recall that 2P(f) df is the amount of power between f 
and (f + df) in the one-sided true power spectrum. Similarly, 2P;(f) df 
is the amount of power between f and (f + df) in the one-sided estimated 
power spectrum. The function H,(f; fi) has one of the necessary proper- 
ties of a physically realizable power transfer function inasmuch as it is 
an even function of f as well as of fi. In general, however, it does not 
have the property of being non-negative at all frequencies f. Neverthe- 
less, it is a convenient function to use in the analysis of the variability 
of the estimated power spectrum. It will be convenient to regard the 
average value of the smoothed power density estimate ave {2P,(fi)} 
as the result of passing the true power spectrum, through a “network” 
with power transfer function H,(f; fi). 

We see that our procedures will lead us to estimates whose average 
values are a smoothing (average-over-frequency) of the true power spec- 
tral density P(f) over frequencies “near” f,, and not to estimates of 
P(fi) itself. The problem of choosing the shape of the lag window D,(r) 
so that its Fourier transform Q,(f) will be concentrated near f = 0 is 
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almost identical to the problem of choosing an intensity distribution 
along an antenna so that most of the radiation from the antenna will 
fall in a narrow beam. F’rom this analogy we will use such terms as main 
lobe and side lobes for the principal maximum and subsidiary extrema 
of Q:(f). (Indeed, any attempt to confine the power transfer function to 
too narrow a frequency band — too narrow in comparison with the re- 
ciprocal of the longest lag used — would be analogous to an attempt to 
construct a practical hyperdirective antenna.) 

It is not surprising that we are led to estimate a smoothed power spec- 
trum. With only a finite length of X(é) available, we should not expect 
to be able to identify frequencies exactly, and are, indeed, unable to do 
so. (The presence of neighboring frequencies with random phases will 
have effects similar to those of noise in preventing such identification.) 


5. TWO PARTICULAR WINDOW PAIRS 


In order to specify a particular family of estimates within the class 
of estimates defined in the preceding section, we have only to specify 
Dr) or Qi(f). We would like to concentrate the main lobe of Q,(f) 
near f = 0, keeping the side lobes as low as feasible. In order to concen- 
trate the main lobe we have to make D,(r) flat and rather blocky. In 
order to reduce the side lobes, however, we have to make D;(r) smooth 
and gently changing. Since D,(r) must vanish for |7| > 7, we must 
compromise. So far, cut-and-try inquiry has been more powerful in find- 
ing good compromises than has any particular theory. 

A simple and convenient compromise is represented by the lag win- 
dow (whose use is called “hanning’’) 


Dr) = (1 + cos for Leh <P 


VT 
m? 


fe 
=0 for ee |i Ts 


(Window pairs 0 and 1 are discussed in Section B.5.) An alternative 
compromise is represented by the lag window (whose use is called “ham- 
ming”’) 


D3(r) for a) <Te 


I 


0.54 + 0.46 cos 


oS 
Tm 
= 0 for [| Dns 


These lag. windows and the corresponding spectral windows are illus- 
trated in Fig. 1. Notice that the main lobes are four times as wide as the 
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Fig. 1 — Lag windows Dy» and D3. Spectral windows Q:2 and Q3. 


side lobes (excepting the split side lobes nearest the main lobes), and 
that the (normal) side lobe width is 1/(27 nm). 

The general nature of the spectral windows in these two pairs is the 
same: a main lobe, side lobes at most 1 per cent or 2 per cent of the 
height of the main lobe. There are differences, which are sometimes rele- 
vant, but these may not be obvious. The two most important of these 
differences are: 

(a) The highest side lobe for the “hamming” (spectral) window is about 
3 the height of the highest side lobe for the “hanning’’ window, 

(b) The heights of the side lobes for the “hanning” window fall off 
more rapidly than do those for the “hamming’’ window. 

One difference favors one pair, and one the other. 
These and several other window pairs are discussed in Section B.5. 


6. COVARIABILITY OF ESTIMATES — BASIC RESULT 


It is shown in Section B.6 that, strictly only under Gaussian cir- 
cumstances, the covariance of any two power density estimates of the 
sort we have been considering is given to a good degree of approxima- 
tion by 
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cov (2PAf), PH) & [HAG A) HAGA) 2L)-a 


where the power-variance spectrum I'(f) depends only on the true power 
spectrum P(f) and the effective record length T;, , as described below. 
Thus, we may regard the covariance of the two power density estimates 
as the result of passing the power variance spectrum I(f) through two 
networks in tandem, one with power transfer function H,,(f; f:), the other 
with power transfer function H,(f; fe). In other words, we may regard 
the covariance (of the estimates of the power spectrum) as the power 
remaining from the power-variance spectrum I(f) after passing through 
the two windows H,,(f; fi) and H,(f; fe) associated with the estimates 
themselves. If the windows do not overlap, the estimates do not covary 
(at least not in terms of second moments). 
In particular, of course, 


var {2P.(f;)} = cov {2P,(fi), 2P.(fr)} 
~ | ” HAS; fy 20(f) df 


to which we can give a similar interpretation. 
These results would become exact if we were to replace Coo(r) by 


5 1 (Tr-T m)/2 s i 

Co(r) = hor, [ ene x(¢ s).x (: + 5) at 
where|7| S 7, < 7, .In Coo(7) weaveraged X(é — (7/2))-X(é + (7/2)) 
over an interval of ¢ of length T, — |r|, varying with r. In Coo(r) we 
would be averaging X(t — (7/2))-X(é + (7/2)) over an interval of ¢ of 
length 7, — T., independent of 7. We could actually do this because 
{¢ + (7/2) | S T,/2 for |r| S Tm. However, for values of | 7 | less 
than 7'm, Coo(r) would not make use of some values of X(é — (7/2))- 
X(é + (7/2)) which are used in Coo(r). Thus, Coo(r) would be wasteful. 
It seems best, therefore, to use Coo(r) for computation, but to approxi- 
mate its variability by the variability corresponding to a Coo(7) which 
could not be calculated from the actual values. This “approximate”’ 
hypothetical Coo(r) involves a fixed range of integration 7, part way 
between 7’, — 7, and T,,. The situation is illustrated in Fig. 2, where 
the ranges of integration are shown for the actually “feasible” Coo(r), 
for the Co(7) which “wastes not”, and for the Coo(7) which we use to 
“approximate” Co(r). The shaded areas delineate the products which 
are actually available. 
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The best choice of an intermediate value depends somewhat upon the 
D;(r) and D,(r) involved, and is discussed in Section B.6. In practically 
useful cases we may take 


T, = T, —4Tn. 


The power-variance spectrum is given approximately and closely by 





= / 7 j aus ° 7 
rf) =4f PU +S)-PU-S (Be ) df’ (w! = nf’). 
If we have p pieces of total length T,, , and if, in computing our estimate 
of C(r) for each 7, wecombineall availablelagged products 
X(t — (r/2)) XE + (7/2) 


without regard to which piece they came from, then we may use this for- 
mula for I'(f) with 





_Th- om 






i In 
2 


Be sae 2 — Range of integration over ¢ as a function of 7 in Coo(z), Coo(r), and 
0a(7) 
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7. COVARIABILITY OF ESTIMATES ——- APPROXIMATE FORMS 


In assessing the covariability of estimates of the smoothed power spec- 
trum, the relative magnitudes of three distances along the frequency 
axis are important: 

(a) the distance 1/7", , the reciprocal of the effective length of seeerd: 

(b) the least distance over which P(f) changes by an important amount 
for f near f, , and 

(c) the least distance over which H,(f; f:) changes by an important 
amount for f near f; (this is of the order of 1/7’, and is usually much 
larger than 1/7). 

If P(f) changes slowly enough to make (b) larger than (a), we may 
use the approximation 


rf) = ar (POE 
whence, approximately, 
cov (Pid, PAD) & gr | PalS)-Palf)-af 
where 
Pa(f) = Ah APY) 
Pf) = Hilf; f)P(f). 


In the same terms we have 


ave {Pi(fry} = [ Palf) “df 
and 


ave {P;(f2)} = [ Peta)-ar. 


The relation of covariances to averages thus established may be rea- 
sonably interpreted as meaning that any cancellations occurring in the 
average values also occur in the covariances and variances. To the ac- 
curacy of this approximation, then, we appear to be using the data rather 
efficiently. 

If, on the other hand, the true spectrum, P(f), consists of a single 
sharp peak at f = fo, we may use the approximation, derived in Sec- 
tion B.7, namely 


cov (PAR), PAD) &[ [ Pats)-ar]| [ Pats) at| 


ave {P.(fi)}-ave {P;(fo)}, 


MEASUREMENT OF POWER SPECTRA 205 


a result which is not influenced by 7%, (so long as 7’, does not become 
large enough for 1/7", to become comparable with the width of the peak). 


8. VARIABILITY — EQUIVALENT WIDTHS 
° . . / . 
If P(f) changes slowly in comparison with 1/7, , then, since 


var {Pifi)} = cov {PaAfi), Pilf}, 


we may write down the dimensionless variability of P;(f,) itself as 


var {P.(fi)} 1 


lave {PACAP TLW.’ 


[ [° eatsy-ar] 
[ eacny-ap 


is naturally called the equivalent width of Pa(f) = Hilf; fi)-P(f). 
The longer the record, and the wider the equivalent width, the more 
stable the estimate. (Increasing the width also of course makes the esti- 
mate refer to an average power density over a wider frequency interval.) 
If, on the other hand, P(f) consists of a sharp peak, then, by the con- 
cluding remarks of the preceding section 


var {P.(fi)} 
fave {Pi fi) }? 


The equivalent widths of some simple cases are as follows: 
1. If Pa(f) is a rectangle of width W which does not cross f = 0, 
then W, = W. 
- 2. If Pa(f) is a triangle of base W which does not cross f = 0, vertex 
anywhere over the base, then W, = 0.75 W. 
3. If Pa(f) is proportional to 





where 


é@ 


= 1. 


. Ww 1 - @ = Ww 
sin sin 2 
+ ——___—- 
aw Wy W— 


W W 


i.e. has the shape of Ao(f; fi), where W = Wirain = 2Weiae (these being 
the widths of main and side lobes, respectively), and if f, = 1/7 then 
W. = 0. W = 0.5 Winsin = Woaiae- 

4. If P(f) has the shape of H2(f; fi), i.e. is proportional to a hanning 
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(0.25, 0.5, 0.25) window, and if f; = 1/7, then W. = 0.67 Winain = 2.67 
W siae . 

5. If Pa(f) has the shape of H;3(f; fi), i.e. is proportional to a hamming 
(0.23, 0.54, 0.23) window, and if fi = 1/7, then W, = 0.63 Wmain = 
2.52 Wide - 

These cases are illustrated in Fig. 3, a single sketch sufficing for the | 
last two. Note that W. is close to 2Wmain for practical windows, if 
Fp AS Tes 

For our standard window pairs, hanning or hamming, the width of 
the normal side lobes is 1/(27',) and, consequently, W. ~ 1.30/T,, , 
if f, 2 1/Tn. 

These last three equivalent widths decrease somewhat as f; becomes 
small, and the values given should be halved for fi = 0. 

If P(f) varies linearly across H,(f; fi), then a calculation discussed 
in Section B.8 shows that W, will tend to fall in the range from 1.15/7T'm 
to 1.23/T, . (A rather peaked case gives 0.94/7'» .) When we allow for 
the fact that we are likely to be concerned with processes which are not 
quite Gaussian, whose variances of estimate are consequently likely 
to be somewhat larger than for the Gaussian case, a change correspond- 
ing to the use of a decreased equivalent width in the formula, the choice 


Sf cuk 
We D - 


™m 


which introduces a small factor of safety (not more than.1.3) seems de- 


| | _ZA 


eel Paar 
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al we be ~-We-— 4 | 


Fig. 8 — Equivalent widths of some spectral windows. 
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sirable for planning purposes. Consequently, we shall plan according to 


var {Pi(fi} Pe Pn 
lave (P(f)}P Th" 


If we plan to hold the RMS deviation of each of our estimates below 
one-third of its average value, we must, accordingly, keep 7/7, below 
3. Chus, as noted above, we shall ordinarily keep 7, to a small fraction 
of T,, . 

In making more detailed studies of the variability of spectral esti- 
mates, further approximation will be convenient. It is important to 
note several reasons why we need not be too precise in making such ap- 
proximations. First, as noted earlier, the variability results depend on 
the noise being exactly Gaussian. Real noises (and especially real signals) 
need not be exactly Gaussian. Thus, even exact results in Gaussian 
theory would be approximations in practice. Second, the chief purposes 
of studying variability are first to choose, once for all, effective methods 
of analysis, and then, in each situation, to determine about how much 
data will be required for the desired or given accuracy. Again, approxi- 
mate results will be adequate. Third, it would not be safe to use the ad- 
vance estimates of variability as firm, guaranteed, measures of the sta- 
bility of the actual computed results in a practical situation, since other 
sources of variability may well contribute to the deviation of a particu- 
lar spectral density estimate from its long run value. (Non-constancy 
of total power level, even with distribution-over-frequency remaining 
constant, and failures of stationarity are two simple examples.) We 
must rely on observed changes from trial to trial as basically the safest 
measure of the lack of stability of our spectral density estimates. 

Thus, the purposes of variability theory are well served if its results 
are approximate — deviations of actual variability from theoretical 
variability of +5 per cent, +10 per cent or even +20 per cent will be 
quite satisfactory. Judged by this standard, the variability theory based 
on (i) the Gaussian assumption and (ii) treating the distribution of the 
spectral density estimates as if they followed so-called ‘chi-square’ 
distributions, as we shall do in the next section, will usually be very 
satisfactory. 


9. CHI-SQUARE — EQUIVALENT DEGREES OF FREEDOM 


If y1, yo, *°* 5 Ye are independently distributed according to a stand- 
ard normal distribution, that is, according to a Gaussian distribution 
with average zero and unit variance (and, consequently, unit standard 
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deviation), then 
Xe a y+ ys + ase Yu, 


which is obviously positive, follows, by definition, a chi-square distri- 
bution with k degrees of freedom. The coefficient of variation of x;’, 
the ratio of RMS deviation to average value, is (2/k)'”, so that, as k 
increases, x. becomes relatively less variable. This statement also ap- 
plies to any multiple of x”. 

A convenient description of the stability of any positive or nearly- 
positive estimate is its equivalent number of degrees of freedom, the num- 
ber of degrees of freedom of that x,’ some multiple of which it resembles 
(in average and variance unless otherwise specified). We can find such a 
k from 


2 
_ 2(average)” _ 2 
variance (coefficient of variation)? ° 


k 


Interpretation is aided by Tables I and II. These tables are possible 
because the distribution of the ratio of any multiple of x,” to the aver- 
age value (of that multiple) depends only on k. Thus, if k = 4, individual 


TABLE I 
Distribution of quantities which are distributed as fixed multiple of chi- 
square. Ratios of individual value to its average value exceeded with 
given probabilities. 














Degrees of freedom Barceeded Dy 0e of Beene 2d % of Exceed by 10% of 
1 0.016 0.46 2.71 
2 0.10 0.70 2.30 
3 0.19 0.79 2.08 
4 0.26 0.84 1.94 
5 0.32 0.87 1.85 
10 0.49 0.93 1.60 
20 0.62 0.96 1.42 
30 0.69 0.98 1.34 
40 0.73 0.98 1.30 
50 0.75 0.99 1.26 
100 0.82 0.99 1.18 
200 0.873 1.00 1.139 
500 0.920 1.00 1.081 
1000 0.9438 1.00 1.057 





Examples: (1) If the long run average is 10 volts?/eps, then among estimates 
with 10 degrees of freedom, 10 per cent would fall below 4.9 
volts?/eps, and 50 per cent would fall above 9.3 volts?/eps. 

(2) If a single observed estimate, with 5 degrees of freedom, is 
observed to be 2 volts?/cps, then we have 80 per cent confidence 
that the true long-run value hes between 2/1.85 = 1.08 
volts?/eps and 2/0.32 = 6.25 volts?/eps. 
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Taste IJ] — Beuavior or x;2 ON DECIBEL SCALE 





& required for intervalt of spread 




















denction st Spread* of intervalf in dbf eee 

10 db 5 db 2 db 1 db 
40% an. 6/Vk — 1 1 3 11 42 
60% - 10/Vk — 1 2 5 28 105 
80% 16*/VW/k — 1 4 11 63 250 
90% 200/Vk — 1 5 18 104 410 
96% 25/Vk — 1 8 27 161 640 
98% 29/Vk — 1 10 34 207 . 820 





* Accurate to nearest integer in numerator for k 2 4, except for 80 per cent, 
where 16 should be replaced by 15 for k S 11. Based on Tukey and Winsor. 13 
(Spread is the difference between the upper boundary expressed in db, and the 
lower boundary expressed in db.) 

f All intervals are symmetric in the probability sense, half of the non- included 
probability falling above and half below the interval. 

t Since we are dealing with measures of variance, analogous to power, 10 a= = 
(factor of 10), and (number of db) = (10 logio ratio of variances). 


values of any particular multiple of x,’ will, in the long run, fall below 
0.26 times their average value in 10 per cent of all cases (will be 5:8 db 
or more below average in 10 per cent of all cases). Similarly, individual 
values will, in the long run, fall below 0.84 times their average value (be 
0.7 db or more below average) in 50 per cent of all cases, and in 90 per 
cent of all cases will fall below 1.94 times their average value (be 2.9 db 
or less above average). Thus, in the long run, 80 per cent of all values 
will fall in an interval of spread (2.9) — (—5.8) = 8.7 db. 

Thus, for example, to obtain 4 chances in 5 that a single observed 
value will lie within +30 per cent of the true value we require (see Table 
I) about 40 degrees of freedom, while to obtain 4 chances in 5 that a 
single observed value will lie in a prescribable interval of length 5 db, 
we require (see Table IT) at least 11 degrees of freedom. 

The results of the preceding section indicate that, for an estimate .of 
smoothed spectral density, when P(f) is smooth, the number of degrees 
of freedom is given by 


W. 

Af” 
where the latter form expresses the number of degrees of freedom as the 
number of elementary frequency bands, each of width 


1] 
Af = 597 > 


k = 2T.W, = 


contained in the equivalent width W, . 
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For design purposes, the relation of the last section (including the 
small safety factor) indicates that 


= 2 = gy 2s 
[var [Pifi)Vlave (PAP ~ Tn 


when P(f) varies slowly. (This will usually be the case if (¢) k > 3 or 4, 
say, and (27) Pn(f) is a moderately smooth single hump. For, under these 
circumstances, Pi(f) will not change rapidly in a frequency interval 
1/T;, and the same property can then be inferred for P(f) itself.) 

When, on the other hand, P(f) consists of a single sharp peak, we 
find, using the last result of Section 7, that k = 2, so long as 1/7, . 
is not small enough to be comparable with the width of the peak. At 
first glance, this result may appear a little surprising, but when we 
notice that a single spectral line corresponds either (a) to frequency 
+f and to frequency —fo , or (b) to cos wot and to sin wot , or (c) to ampli- 
tude and to phase, it appears quite natural that a sharp line carries two 
degrees of freedom and not merely one. 

We may summarize the semi-quantitative study of the stability of 
estimates of the smoothed power spectrum as follows: 

(a) It is not necessary to judge stability with very high accuracy. 

(b) It is convenient to measure stability by analogy with the number 
of degrees of freedom associated with a multiple of a chi-square variate. 

(c) The equivalent number of degrees of freedom can be regarded as 
the number of elementary bands of width Af in the equivalent width W, 
of the filtered spectrum 


2Pa(f) = Ad fi)-2P(/)  (f 2 0) 


if the result is not too small (say > 3 or 4) and Pa(f) is moderately 
smooth. 

(d) If the filtered spectrum approaches a single sharp peak, the 
equivalent number of degrees of freedom for the corresponding estimate 
approaches two. 

In interpreting the concept of equivalent number of degrees of freedom, 
it may be helpful to imagine the continuous density of the filtered spec- 
trum replaced by a discrete set of ordinates, one per elementary fre- 
quency band. If these ordinates are po , p1 , po, °*-, the natural approxi- 
mation to the number of degrees of freedom is 


pis (po + pi + po + +)? 
po + pi? + po? + he 


as illustrated in Fig. 4. This approximation will usually be satisfactory 


MEASUREMENT OF POWER SPECTRA 211 


Hoste { tacase ft 





ACTUAL POWER POWER AT SPECIFIC 





SPECTRUM ~ FREQUENCY ~~~ 


oi ; 
S| ATT 
MUTE 1 Yrtres TATU) 


FREQUENCY —> eae ed 









‘4 _ 





ELEMENTARY cecaueAee aang BAND ae Tey BAND 
FREQUENCY YIELDING 7 DEGREES YIELDING 5.55 DEGREES 
BANDWIDTH OF FREEDOM OF FREEDOM , WHERE 
2 
5.55 = (0.5+1.041.7+2.3+3.14+3.4 +3.7 
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Fig. 4 — Equivalent degrees of freedom. 


as long as the effect on & of moving each ordinate around within its 
elementary frequency band can be neglected. (In more extreme cases, 
an approximation based on two ordinates per pair of elementary fre- 
quency bands is more precise.) 


10. DIRECT ANALOG COMPUTATION —- GRADED DATA WINDOWS © 


We have been dealing thus far with continuous time, and the com- 
munications engineer will naturally ask, ‘‘Why introduce autocovariance 
functions and all that, why not measure the spectrum by filtering, recti- 
fying, and smoothing?”’. The only fair answer.is ‘‘By all means, do so if 
you can obtain, and maintain, the necessary accuracy economically!” 
Let us apply our results to such a measurement technique. 

Let X(é) be the noise or signal whose power spectrum P(f) we wish 
to study. Let us pass it through a filter of transfer function Y(f), and 
designate the result by Xout(t). Its power spectrum, Pou(f), will be given 


by 
Powlf) = | Y(f) PPC) 
and if a section of Xour(t) of length 7, is applied to an ideal quadratic 


rectifier and smoothed by a smoothing circut of infinite time constant, 
the result will be 


i ” Exve(OF at. 
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The average value of this result divided by T,, is 


i ” 2Poulf) af, 


and the number of equivalent degrees of freedom is the number of 
elementary frequency bands, of bandwidth 1/(27,), contained by the 
equivalent width of | Y(f) |’-P(f). This last function is of the form 


(power transmission function) (original power spectrum) 


just as before. We see that the ideal process of filtering, rectifying, and 
smoothing the actual input has produced the same accuracy as the ideal 
process of calculating, modifying, and transforming the apparent auto- 
covariance, provided that | Y(f) |? = H.(f; fi) for a suitable choice of 
Tm,jf, and f; . This is what we ought to have expected, since we believe 
that either method extracts nearly all the information about the spectrum 
which the data provides. 

A few practical considerations deserve mention. They center around 
the actual switching sitations which can arise, especially when we have 
only a finite sample of the original noise. In Fig. 5, the watt-second meter 
includes quadratic rectification and integration functions which we think 
of as ideal. (It may be very important to allow for the fact that the 
“round” position of switch a is not quite at the same potential as the 
zero of the input noise, but we shall neglect this effect for the moment.) 

Some four sorts of operation can arise according to the times at which 
switch B is operated. The watt-second meter may be connected either at 
the beginning of the running period 7 or after some interval of time 
(to allow initial transients to become negligible), and may be discon- 
nected either at the end of the running period T or after some interval of 
time (to allow the meter to reach a maximum). These four modes of 
operation are illustrated in Fig. 6. 

In Mode I, providing the initial waiting period is long enough to allow 
transients to become negligible, the filter output is essentially stationary, 
and the earlier discussion in this section applies. 


STATIONARY . 7 
RANDOM WETER 
PROCESS A B 

: O FILTER O 
DUMMY — 
LOAD 


Fig. 5 — Schematic analog circuit. 
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Fig. 6 — Time histories of operation for different modes. 





In Mode II, all of the energy output is recorded on the meter, but the 
reading is divided only by the length of the input data. This mode is 
amenable to exact and complete analysis which is given in some detail 
in Section B.10. The results differ from those of Mode I in that the 
transform of the boxcar function of length T (running period) is con- 
. volved twice into the spectral window. (Convolution is defined and dis- 
cussed in Appendix A.3.) If 7’ is not large, the effects may be somewhat 
uncomfortable in that the spectral window becomes wider and more 
ragged. 

Mode III, discussed briefly in Section B.10, differs from Mode II 
by an additional convolution whose effect again disappears as T’ —> ©. 

Mode IV resembles Mode I in that the noise input is passed through 
the filter until transient effects have become negligible, when the meter 
is switched on at the filter output. It differs from Mode I in that the 
meter is read after a final waiting period. This seems to offer no advan- 
tages over Mode I, and will not be discussed further. 

The contrast between Mode I and Mode II is another example of what 
should now be becoming familiar. Mode I has no additional convolution 
in the spectral window. Mode II provides data economy by making it 
possible to integrate over the whole length of the available record. We 
should really like both advantages. 

We can, indeed, obtain most of both advantages, but only by replac- 
ing the sharp edges of the switched data window by the smoothed outlines 
of a graded data window. In other words, we need to introduce 
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Xin(t) = BY -XM 


at the input of the filter, where B(t) vanishes except for 0 < ¢ < 7’, and 
is smooth enough to have its Fourier transform /(f) concentrated near 
f = 0. Details are discussed in Section B.10. 

Difficulties arising from the fact that the zero of the X(é) input might 
not be at ground are shown in Section B.10 to behave similarly to 
those arising from switching transients, namely, no effect in Mode I, 
possibly uncomfortable in Modes II and III, usually negligible when a 
well-chosen graded data window is used. 

Another device is sometimes used to make maximum use of a finite 
noise record. The record is merely closed into a continuous loop, and the 
rectifier-smoother output averaged. It is shown in Section B.10 that 
here, too, we must use a graded data window B(t). 


11. DISTORTION, NOISE, HETERODYNE FILTERING AND PREWHITENING 


Another group of very important practical considerations center 
around the spectrum of the “signal” as it is handled (either instan- 
taneously, or in recorded form). We have spoken of “filtering, rectify- 
ing and smoothing” and have treated all these steps as ideal. No atten- 
tion has been given to the equally vital “gathering” and ‘transmission 
and recording” steps. Tacitly, they too have been treated as ideal. 
Realistically, we must expect a certain amount of distortion (non- 
linearity, intermodulation, etc.) and the addition of a certain amount of 
background noise in all three of the first steps: gathering, transmission 
and recording, filtration. It often proves to be most important to lessen 
the ill effects of such distortion and noise addition. 

In a perfect system, and with a fixed spectral window, the fluctuations 
of an estimate are proportional to its average value. If we have a fixed 
uniform noise level, it will do the least additional damage if all the 
average values of the estimates are of about the same size, for then no 
low estimate can “disappear” into the noise. 

Intermodulation distortion will have the greatest effect on the signal 
being transmitted when two strong frequencies combine to produce a 
modulation product whose frequency falls in a very weak region of the 
spectrum, for it is in such situations that the fractional distortion of the 
spectrum reaches its maximum. To minimize possible effects of intermod- 
ulation distortion it is again desirable to transmit, record and generally 
handle signals with a roughly flat spectrum. 

To these noise and intermodulation considerations another sort of 
consideration may be added. Many frequency analyzers use a hetero- 
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dyne system, bringing the frequency band to be studied to a fixed filter, 
rather than tuning a filter across a wide frequency band. The power trans- 
fer function of the combination of heterodyne modulator and fixed 
filter, referred to input frequency, will depend only on Af, the deviation 
of | f | from | fo |, where fo is the nominal frequency of the fixed filter, and 
will be denoted by Q,(Af). If demands at different frequencies differ, 
the shape of Q,;(Af) may have to be a compromise. One sort of demand 
arises when P(/) varies very rapidly. The net contribution near frequency 
fi to the average value of the spectral density estimate is measured by 
H(f; fi) P(f), where, as elsewhere, H,(f; fi) = Qf + fi) + Qf — fr). 
If our estimate is to be useful, only f’s near f; should have a substantial 
net contribution. If P(f) rises steeply as f leaves f;, we may have to 
requirea very rapid fall-off in H,(f; fi), here practically equal to Q;.(f — fi), 
in order to attain this as f leaves f;. We may thus be forced to compro- 
mise properties of Q,(Af) useful near other frequencies. The simplest 
way to avoid such problems is to arrange for the P(f) of the “signal” 
analyzed to be fairly constant, or at most slowly varying. 

Thus, for a variety of reasons, we can often gain by introducing ‘‘com- 
pensation” or “preemphasis” to make more nearly constant the spec- 
trum of the “signal” actually transmitted or recorded, and analyzed. 
Since the ideal would be to bring the spectrum close to that of white 
noise, it is natural to refer to this process as prewhitening. Such flattening 
of the spectrum need not be precise, or even closely approximate. We 
need only to make the rate of change of P(f) with frequency relatively 
small. 

Because of advantages related to the noise and intermodulation dis- 
tortion introduced in various steps of the sequence, it will be best, other 
considerations aside, to carry out such prewhitening at as early a point 
in the measurement-analysis sequence as possible. Sometimes this can 
even be done in the pick-up or sensing element. 

This whole philosophy of prewhitening, which appears quite natural 
to the communication engineer familiar with preemphasis and other 
techniques for increased information transfer within a given frequency 
interval, comes as a great change to the instrumentation engineer, whose 
clients ordinarily require “faithful” reproduction of an input at the out- 
put, by which they mean phase shifts nearly linear with frequency, and 
a nearly constant amplitude response up to some high frequency. It will 
be rare indeed, in practical spectrum analysis, that the ideal response 
for the initial transducer and amplifier will be flat. Instead it should 
have a characteristic contributing to prewhitening. This characteristic 
will, of course, have to be measured separately and the corresponding 
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adjustments to the estimates of the spectral density will have to be made 
so that these estimates, instead of applying to the “signal” actually 
analyzed, apply to the original “input signal”, but such labor will often 
be many times repaid. 

One further consideration about frequency responses in measurement 
now enters naturally. In almost every power spectrum problem there is 
an upper frequency beyond which there is no appreciable interest. In 
most components used in measurement, transmission, recording, etc., 
the noise level, and often the level of intermodulation distortion, is 
roughly a fixed fraction of the peak useful level. If substantial power is 
present at frequencies so high as to be uninteresting, then the need to 
keep total power below the peak uscful level forces us to handle the 
interesting frequencies at a power level below that which could other- 
wise be used. The ratio of noise and intermodulation distortion to inter- 
esting signal is thus raised — the quality of the analysis and its results 
degraded. The appropriate remedy is to filter out the uninteresting high 
frequencies at as early a stage as possible. This is a further reason why a 
carefully tailored frequency response is an important part of a power 
spectrum measuring process. 

Together with the need for adequately wide filters (we can of course 
use narrower filters when we are prepared to average over homogeneous 
records of sufficiently long total duration) to provide enough equivalent 
degrees of freedom, and hence enough stability for the estimates, this 
tailoring of frequency response is often the crucial part of a power spec- 
trum measuring program. Indeed, there may sometimes be no reason- 
able way to measure power spectra with an ill-tailored frequency re- 
sponse, even if this response be “‘flat’’. 


EQUALLY SPACED REcORDS 


We come now to treat a modified situation of great practical impor- 
tance, where the observations are used for analysis only at equally spaced 
intervals of time — not as a continuous time record. Two new and im- 
portant features enter: there is aliasing of frequencies, and practical 
analysis will involve digital rather than analog computation. In general, 
however, the situation is surprisingly similar to the case of a continuous 
record, with limitations on data-gathering effort still forcing us to com- 
promise resolution and stability. Advantages of convenient calculation 
and noise reduction still lead us to prewhitening. Filtering of equi-spaced 
data must involve transversal filters (see Glossary of Terms for defini- 
tion) whose transmission properties (in frequency) exhibit a periodic 
symmetry. This exerts additional pressure toward prewhitening. 
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Questions regarding computational techniques arise anew because 
of the nature of digital computation. These include means for reducing 
the effects of a displaced (perhaps drifting) zero, smoothing by groups 
to economize arithmetical operations on the whole, and preliminary 
rough estimation as an aid to planning. 


12. ALIASING 


We now suppose that X(¢) is available, or is to be used, only for uni- 
formly spaced values of ¢, which we may as well suppose to be 


t = 0, At, 2At, 3At, ---, nAt, 
so that C(7) can only be estimated for 
| 7] = 0, Ad, 2Aé, +--, nAt. 


Now, the equations 
CG= i: 2P,(f)-cos Qafr-df, 
0 


| 7] = qdt, q=0,1,-:-,”, 


if soluble at all, can always be satisfied by a P4(f) which vanishes for 
f > fw = 1/(2A8), although the power spectrum P(f) of the original 
process (for which the C(r) was defined) might actually cover a much 
wider frequency range. (We shall reserve the notation P4(f) for such a 
function, vanishing for |f| > fw.) While frequencies between f = 0 
and f = fy are clearly distinct from one another, we face a problem of 
aliasing, since frequencies above fy usually contribute some power. Each 
frequency, no matter how high, is indistinguishable from one in the 
band from 0 to fy . 

The essential, unavoidable nature of this problem is made clear by 
Fig. 7 which illustrates how equally spaced time samples from any 





Fig. 7 — Sampling of sinusoidal waves. 
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cosine wave could have come from each of many other cosine waves. 
(The familiar stroboscope uses a particular expression of this fact in 
apparently “slowing down” rapidly rotating or oscillating machinery.) 

The logical position about P(f) depends very much on whether X(t) 
is thought of as having any real existence for | ¢| ~ qAt. 

If X(t) really exists for continuous é, although we have (7) failed to 
observe or record it, or (27) failed to ‘‘read”’ the record, or (777) decided 
to neglect the available values, then there is a well-defined P(f) cor- 
responding to the process from which each X(t) is a sample, and we must 
be very careful about the relation between P(f), which is our true con- 
cern, and P4(f), which is clearly all we can strive to estimate directly 
from the data. It can be shown (see Section B.12) that, in the form 
appropriate for a one-sided spectrum, if we set 


2P.(f) = 2P(f) + 2PQfn — f) + 2P(Q2fy + f) 

+ 2P(4fy — f) + 2P ty +f) +o 
then we may take 
P.(f); OS IF Stuy 


: otherwise 


P,(f) -| 


where fy = 1/(2Aé) is the folding (or Nyquist) frequency. We naturally 
call the frequencies f, 2fvy — f, 2fv + f, 4fv — f, 4fv + f, and so on, 
aliases of one another, f being the principal alias. The aliased spec- 
trum P.(f) is the result of aliasing P(f). The principal part of the 
aliased spectrum P.a(f) is the part of P.(f) which corresponds to 
principal aliases, positive and negative. 

(If X(t) has no natural existence for ?¢’s which are not integral multiples 
of At, then P(f) is not uniquely defined, and we are at liberty to choose 
any normalization we desire. In particular, we may decide to limit P(f) 
to the interval | f | S$ 1/(2Ad), in which case we will be enforcing P(f) = 
P.4(f) without any trace of aliasing. We mention this case for logical 
completeness, but remark that it seems to occur infrequently in practice, 
whatever the field.) | 

If the Gaussian noise we are considering has a power spectrum P(f) 
which extends outside |f| < 1/(2Aé), then the Gaussian noise with 
spectrum P4(f) is not the same for continuous time. However, if we con- 
sider these two noises only for equi-spaced times 


t = 0, Al, 2At, ----- 


they are identical. For all first moments vanish and all second moments 
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coincide, which implies coincidence of the joint distributions of any 
finite set from ---, X_,, ---, X-1, Xo, X1, °---, Xq, °°+, and this is our 
definition of the coincidence of two noises. (If a result concerning such 
equally spaced values can be established for a Gaussian noise restricted 
to have P(f) vanish outside | f| S 1/(2Ad), it must trivially hold, under 
the same restriction, when all occurrences of P(f) are changed to Pa(f). 
It is a consequence of the identification just established that the result, 
when expressed in terms of P4(f), must also hold for any Gaussian noise 
whatever.) 

The frequency interval from 0 to fy contains a certam number of 
elementary frequency bands in the sense of our treatment of variability. 
The total length of record is T, = nAt, and if we write T,, = n’At for 
the effective length, then, since 


a2 
a es Se = 2At = n’ 
elementary frequency bandwidth 1 

Hd i 


there are n’ elementary frequency bands between 0 and fy . As a statisti- 
cian would have anticipated, we gain one elementary frequency band — 
one degree of freedom — for each added observation. 

It is perhaps natural to base a hope on this result —a hope that 
taking data more frequently over the same time interval would gain us 
many degrees of freedom and reduce our difficulties with variability. 
However, this is not the case (as the expression for the width of an ele- 
mentary frequency band 1/(27;,) should have warned us). Taking ob- 
servations twice as frequently yields twice as many elementary fre- 
quency bands, but also doubles the folding frequency fy and, thus, 
doubles the frequency interval occupied by principal aliases. The density 
of elementary frequency bands is not increased one iota. (Clearly, iota 
was the Greek word for bit!). 

It is usual for aliasing to be present and to be of actual or potential 
importance. This is an inescapable consequence of data taken or read 
at uniform intervals. (It is not infrequently suggested that there should 
be a workable scheme of taking discrete data in some definite, but not 
uniformly spaced pattern, and estimating the power spectrum without 
aliasing. No such scheme seems so far to have been developed). 


13. TRANSFORMATION AND WINDOWS. 


Given uniformly spaced values of X(t) — values which we shall now 
designate Xo, X1, --+, Xn, as well as X(O0), X(Ad), ---, X(nAt) — we 
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expect to calculate “sample autocovariances”, modify them, and then 
Fourier transform the results. There is no possibility of calculating auto- 
covariances for lags other than 0, Al, ---, nAt, and so we may as well 
write Co, Ci, ---, Cm in place of C,(0), C;(At), ---, Ci(mAt). Tf we 
Fourier transform these m + 1 numbers, as obtained or modified, we 
might obtain a smoothed spectral estimate for any frequency between 
0 and fy = 1/(2A¢) that we may wish. It is not surprising, however, that 
we lose no information (and little explication) if we calculate only m + 1 
such estimates (one for each C,). Nor is it surprising that we regularly 
take these estimates equally spaced over 0 S f S fw, and hence at 
intervals of fy/m = 1/(2mAt). As a consequence we have to deal with 
finite Fourier (cosine) series transformation (classical harmonic analysis) 
rather than with infinite Fourier integral transformation, but the cor- 
respondence between multiplication and convolution persists. 

The question of modification also requires discussion. In the continu- 
ous case we Fourier transformed 


Cr) = Dilr)-Coo(r) = Dilr)-Co(r) 


where Co(7) coincided with Coo(r) wherever the latter was defined, and 
is zero otherwise (cp. Section B.5). The result was, consequently (e.g. 
see Appendix A.3), the convolution of the Fourier transforms of D;(r) 
and C)(r). So long as time was continuous and computation was pre- 
sumably by analog devices, there was a real advantage to modification 
before transformation. Now that time is discrete and computation pre- 
sumably digital, the advantage is transferred to first transforming and 
then convolving. Indeed, because the D;(r), for 7 > 1, are finite sums of 
cosines, so that their transforms are simply sums of spikes (Dirac delta- 
functions) at the appropriate spacing, convolution means only smooth- 
ing with weights 


0.25, 0.5, 0.25 (¢ =2, hanning) 
0.23, 0.54, 0.23 (¢ = 3, hamming) 


and is very simply carried out. 

In discussing this program, we gain some generality by using m + 1 
lags separated by Ar = hAt for an integer h > 0, while our results are 
no more complicated than if we were to confine ourselves to h = 1, 
which is the practical case. Thus, we first compute the mean lagged 
products 


1 q=n—rh 


Cy Sh p> Xq*Xqtrh 


MEASUREMENT OF POWER SPECTRA 231. 


forr = 0, 1, 2, ---, m, where mh < n. Note that C; is heuristically as 
close as we can come to the apparent autocovariance Coo(--rAr) with the 
available (equi-spaced) data. Note further that, so far as functions of 
the C;, are concerned, our effective folding frequency is 


iL 1 
fr* = PAr = > fn. 


Ar Ah 
We will usually need to adjust the C, somewhat to improve very-low- 
frequency performance, as discussed in Section 19, but this need not 
concern us for the moment. 
Applying a discrete finite cosine series transform to the sequence Cy , 
Ci, +--+, Cm, we find 


m—1 
V, = ar-| oO C,: cos um + Cr Cos rx| ; 
gq=1 


(We may regard this as arising from replacing Co(7) in the expression for 
P.(f) as its Fourier integral transform by a finite sequence of spikes 
(Dirac delta functions) of intensities (areas) proportional to the corre- 
sponding values of Co(r).) If we put 


r 
Pos (a) ae 


then it is shown in Section B.13 that 
ave {Poa(f)} = [ Qo(f — f’;Ar)-P(f’)-df’ 
where 


Qo(f; Ar) = Ar-cot oot sin mwAr. 
In terms of Qo(f), which is treated in Section B.5, we have 


Qo(f; Ar) = 2; Qo (7 sf, 2) = Qoa(f). 
q=—%0 Ar 
Just as the average value of Po(f) in the continuous case is the cor- 
responding value of Qo(f) * P(f), so here the average value of Poa(f) is the 
corresponding value of Qo(f; At) *P(f). Thus, we may consider Poa(f) as 
estimating the result of ‘‘smoothing” P(f) with a window Qo(f; Av) which 
has repeated major (and concomitant minor) lobes at intervals of 
2fv* = (Ar) ’. This is not the most convenient way to consider matters, 
and in Section B.13 it is shown that there are two equivalent forms for 
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ave {Pya(f)} and, correspondingly, two other, equally appropriate, ways 
to consider the situation. 
These arise from the three-fold identity 


Qoatf) *P(f) = Qf) *Palf) = Gaal) # Pal), 


any member of which represents the average value of Pos(f). Thus, we 
can also consider Poa(f): (i) as estimating the result of smoothing the 
infinite, periodic aliased spectrum P,(f) with the same window as for 
the continuous case, or (ii) as estimating the result of smoothing the 
principal part of the aliased spectrum Pa(f) with the aliased window 

Qoa(f). The latter choice is usually the most helpful of the three possi- 
- bilities, and is the one we shall adopt. 

All this has been discussed for the immediate results of transforming 
unmodified C,’s. This is only the case 7 = 0 of the identity 


Qia(f) * PS) = Qf) *Paf) = Qia(f) * Pal) 


which holds in general. We should thus usually be concerned with Q:a(f) 
and with P4(f). 

The case 7 = 2 (hanning) corresponds to the following smoothing after 
transformation: 


Uo = 0.5 Vo + 0.5 Vi ) 
U, = 0.25V.1+05V,4+025 Vi, Lsrsm—1l, 
Un = 0.5 Vna + 0.5 Vn, 


I 


I 


for which Qe4(f) has the form shown in Fig. 8. The curve is for m = 12, 
and the circles are for m = , which corresponds exactly to the con- 
tinuous case. Clearly, for usual values of m, the modification in the lobes 
due to aliasing is almost surely unimportant. 

The frequency separation between adjacent estimates is 


sw 
2Tm  2mAr’ 





but the equivalent width of the windows (for 1 S$ r S m — 1) is about 


1.30 _ 130 
Ts mAr’ 


just as for the continuous case (see Section 8). For most purposes we 
may again take the bandwidth corresponding to each estimate as 1/7'n , 
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Fig. 8 — Aliased spectral window Qe, for m = 12. 
so that m satisfies 


- = (bandwidth of estimates) - (Ar). 


If we had neither modified before Fourier transformation, nor 
smoothed after transformation, we should have faced the uncomfortable 
minor lobes of Qoa(f) shown in Fig. 9 for m = 12 (with circles form = «). 
Generally speaking, all we learned about desirable lag windows for the 
continuous case carries over with minor modifications, at most. The 
only serious effect of going to uniformly spaced values is the aliasing 
(and this may be very serious indeed). 

It is well worth noting that the possible spectral windows Qi4(f) are 
now restricted to be finite Fourier series in cos wAr, cos 2wAr, ---, 
cos mwAr, or equivalently, to be polynomials in cos wAr of degree m at 
most. 


14, VARIABILITY AND COVARIABILITY 


We now extend all our other notation: H,(f; fi), P:(fi), ete. to cor- 
responding H;a(f; fi), Pias(fi), ete. for the uniformly spaced case as 
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Fig. 9 — Aliased spectral window Qoa for m = 12. 


specified in Sections B.13 and B.14. It is shown in the latter section that 
we now have 


cov {2Pia(fi), 2P3a(fo)} = [ iat; ft) Ayah; fo) 20 al f) df 


where 
as j sin wT, \" (‘sin w'At\~ 
Taf) x4 | Pall + Pals! — f(a) 7 ) df’, 
- co or. oy At 
(a! =. 2zxf’), with a very slightly different determination of 77, than be- 


fore. The only essential change has been the introduction of a new 
factor, corresponding to aliasing, 


(= oF 

w’At 

into the integrand of the power-variance spectrum T,,(f). For usable 
values of 7, this factor will vary much more slowly than 


. , 2 
sin or) 
oT" 
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and can usually be treated as sensibly equal to unity. All the approxi- 
mate analysis of covariability and variability given for the continuous 
case now goes through without essential change. 


15. PREWHITENING 


If the equally spaced data is sampled from a continuously transmitted 
“signal” or “read” from a continuous recording, then all the points 
made in Section 11 in favor of early prewhitening are still applicable. 
If the equally spaced data arises more directly, as by photographing a 
physical situation, we may not be able to apply prewhitening early. In 
either case it may still be desirable to prewhiten after the data is obtained 
at equal intervals, either as a supplement to, or as a partial replacement 
for, early prewhitening. 

The average value of a power density estimate Pj4(fi) is 


ave {Pis(fo} = is Pia f)-df, 


where 
Pinlf) = Hia(f; fr): Palf). 


We want this quantity to tell us about the values of P(f) for f near fi. 
To do this we must: (i) reduce variability, (ii) ensure that Pu(f) re- 
sembles P(f) sufficiently, and (iii) concentrate P;ai(f) near f = fi. We 
must be concerned with: (i) adequately broad windows, (ii) sufficiently 
weak aliasing, and (iii) enough sharpness in the effective filter. This 
sharpness can be obtained in a combination of ways. 

Note that we asked for Piai(f), which measures the net contribution 
to the average value, to be localized. We did not merely ask that Hia(J; fi) 
should be localized. For, if 


Pa(fe) >>> Pah), 
although 
His(fos fit) << Hiahi 3 fi), 
it is still possible for 
Pialfe) = Hiafe 3 fr): Pfr) 
to outweigh 
Piaf) = Hilf 3A) Pah), 


so that our estimate tells us more about P(f) near f = fo than it does 
about P(f) near f = f;. To avoid such unfortunate situations either we 
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must choose our window pair in a very particular manner (so as to make 
Hja(fe 3 fi) exceptionally small) or we must avoid Pa(f2) >>> Pa(fi). 
Both courses are possible and sometimes necessary. Usually, the second 
course is simpler. 

- Following the second course is simple in principle. Given actual values 
X,, we apply a selected linear procedure to obtain new values X, and 
analyze these. The aliased spectrum P.(f) of the X, differs from the 
aliased spectrum P,(f) of the X, by a known multiplicative function of 
frequency. (See Section B.15 for details.) Thus, (i) we may convert 
estimates of P4(f) into estimates of P4(f), and (ii) we may choose the 
linear procedure to make the aliased spectrum P4(f) of the X, reasonably 
flat. 

The simplest linear procedures are probably the formation of moving 
linear combinations and the construction of autoregressive series. A 
simple example of a moving linear combination is 


ie = Xs oa aX o-1 = BX q-2 —= VX o-3 
for which the relation between the spectra is 
P,(f) P(f) —iwAt —i2wAt —iswAt \2 
——- =. = /|1 — ae =e — ye 
Pad) PG) | 
= a cubic in cos wAl. 


A suitable moving linear combination will generate any desired non- 
negative polynomial in cos wAt. 
A simple example of an autoregressive combination is 


x = Be + iO, me —E pXq-2 = vXq-3 


for which the relation (reciprocal to that just considered) between the 
spectra is 





Pay PE) nee pete yg PHBE yg Babe 2) 


= (a cubic in cos wAt)™. 


A suitable autoregressive combination will, when indefinitely continued, 
generate the reciprocal of any desired non-negative polynomial in 


cos wAt. 
By combining a suitable moving linear combination with suitable auto- 


regression, as for instance in 


Xa = oF — aX q1 + Gee 
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which may also be written 


XxX, comes ee = X q pie aX¢1, 





for which 
Pf) _ PU) _ | page 
Paty PO) ee ae 





—_ lta’ — 2a cos wAt 
1 + dA? — 2d cos wlt 





= a rational function of cos wAt, 


we can modify Pa(f) by multiplication by an arbitrary non-negative 
rational function of cos wAt. 

Freedom to multiply by any (simple) non-negative rational function 
of cos wAt is very substantial freedom. If we have a rough idea (see Sec- 
tion 18) of the behavior of P4(f), and if this behaviour is moderately 
smooth, though perhaps quite steep in places, we can usually do a very 
good job of flattening the spectrum by prewhitening after obtaining dis- 
crete (digital) values. Unless still bothered with steep slopes, we will 
usually then find that hanning, with its (0.25, 0.50, 0.25) weights and 
lower outer lobes is slightly preferable to hamming, with its (0.23, 0.54, 
0.23) weights and reduced first minor lobes. 

The main purpose of prewhitening after data has been obtained in 
digital form at equally spaced intervals is to avoid difficulty with the 
minor lobes of our spectral windows. We may regard the whole process 
of prewhitening, analysis with standard spectral windows, and, finally, 
compensation of estimate, as a means of constructing a set of specially 
shaped spectral windows, one for each center frequency, specially adapted 
to the data we are processing. This point of view is illustrated in Tig. 10. 
The uppermost curve shows the power transfer function of a hypothetical 
prewhitening filter, one which enhances mid-frequencies in comparison 
with those lower and higher. The next line shows two standard spectral 
windows, with symmetrical side lobes. The third line shows the effective 
spectral windows when prewhitening is followed by standard analysis, 
as given by the product of prewhitening power transfer function and 
spectral window. In either case, the side lobe toward mid-frequencies is _. 
higher than the corresponding side lobe on the opposite side, which is 
lower than for the standard. The lowest curve shows alternative spectra 
for time series which might reasonably be processed by the combination 
of prewhitening and standard analysis shown (since the prewhitened 
spectra would change only slowly). In every case, the side lobes of the 
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special spectral windows are automatically so related to these spectra, 
as to balance and reduce the amount of leakage through them, as given 
by the product of special spectral window side lobe and original spectral 
density. 


(6) 

(a) 
ie) 

(b) 
ie) 





(d) 


Fig. 10 — Illustration of prewhitening; (a) prewhitening power transfer func- 
tion, (b) standard spectral windows, (c) effective spectral windows, and (d) typi- 
cal input spectra to which (a) might be applied. 

Easing of requirements for accuracy (number of significant figures, 
etc.) during computation are ordinarily quite secondary, though pleas- 
ant, advantages of prewhitening during digital calculation. 


16. REJECTION FILTERING AND SEPARATION 


If the difficulties in handling P(f) are due, wholly or in part, to one or 
more quite narrow and very high peaks (‘‘ines” or “narrow bands’’) 
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then we cannot expect either to afford, or to be able to estimate, the 
great number of accurately chosen constants which would be required to 
obtain a rational function whose reciprocal has a shape very close to the 
given narrow peak. We must adopt a slightly different approach, and 
plan to make at least two analyses of the data — one to estimate the 
behavior at the peak, and another to estimate the behavior away from 
the peak. 

In order to separate the bulk of the information in the data from the 
variation associated with the sharp peak which may be troubling us, we 
may apply to the data a moving linear combination (possibly combined 
with autoregression) whose power transfer function (the factor by which 
the spectrum is altered) has one or more zeroes near the peak. The 
resulting sequence will be largely free of contribution from the peak and 
hence will be suitable for further prewhitening (if required) and analysis. 
(This operation can often, of course, be combined with further prewhiten- 
ing so far as actual calculation goes. It will of course be necessary to 
compensate for the effects of this transformation at frequencies away 
from the peak, when preparing the final spectrum estimates for interpre- 
tation.) 

There remains the estimation of the power in the peak, and possibly 
some inquiry into its width. A number of approaches are possible: 

(1) We may analyze the original data as well as the data with the 
peak rejected, obtaining an estimate at the peak and possibly confirma- 
tory estimates far from the peak. 

(2) We may subtract a suitable multiple of the modified data from the 
original data so as to retain the peak and partially reduce other fre- 
quencies; and then analyze the difference. 

(3) We may apply a band-pass filter to isolate frequencies at and near 
the peak, and then analyze the result. 

Any of these techniques may be applicable in suitable circumstances. 

Other related procedures are sometimes more natural than the use of 
moving linear combinations. Rejection of zero frequency, for example, 
is more naturally, and computationally more easily, accomplished by 
subtraction of the mean of all the data from each X, than by the sub- 
traction of a moving linear combination from each. 

Rejection filtration has been applied in oceanography by Groves,’ 
Seiwell,” Seiwell and Wadsworth,” to the elimination of various well- 
defined tides from records. It almost always has to be used to eliminate 
possible peaks at zero frequency (see Section 19 below). 

In electronic measurements we may also anticipate its possible use in 
measurements: (i) close to a substantial harmonic of 60 cycles per 
second (such as 120 eps or 1380 eps), or (ii) near some strong “carrier’’. 


4 
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17. SMOOTHING BY GROUPS 


The cost of digital power spectrum analysis, once initial investments 
in programming, etc. have been made, and assuming records to have 
already been made and ‘‘read’’, is likely to be associated with the number 
of multiplications involved in computing the mean lagged products (in 
original or modified form). If there are n observations, and m lags are 
used, then there will be roughly nm multiplications. 

Ways of reducing this number substantially are naturally of interest. 
Most of these must depend for their efficacy on our interest in something 
less than the whole spectrum. We have already discussed (in passing) a 
situation which would naturally arise only when we are interested only 
in the lower part of the aliased spectrum. This is the use of lags which 
are multiples of Ar = hAt with h > 1. The use of lags up to mAr = hmAt 
allows us to explore the spectrum down to frequencies almost of the 
order 1/hmAt, which, had we used all multiples of Aé up to hm, would 
have required hm + 1 values of C, (or of its modifications) instead of 
m + 1. The price of doing this is the aliasing of the spectrum with fold- 
ing frequency 1/(2Ar) = (1/h) (1/(2At)), which is h times as much 
aliasing as if all multiples of Aé up to hm had been used, yielding a fold- 
ing frequency of 1/(2At). 

If such intensive aliasing is bearable, this procedure with Ar > At is 
simple, even though it is not necessarily economical. Indeed, if so 
much aliasing were permissible, we need only have ‘read’ every hth 
data value. In many situations, however, especially where At has been 
taken as large as aliasing will permit, such further aliasing is unbear- 
able. If we are to look at the low frequency part of the aliased spectrum 
P 4(f)with computational economy, another course will have to be found. 

Our use of linear schemes in prewhitening shows us a possible course. 
Let us begin by applying a linear scheme to the given values X, , which 
attenuates all high frequencies. Then we can face further aliasing, and 
proceed apace. 

If simplicity is controlling, then we take 


Xo=Xet Keates + Xeeu (k terms) 
for which the relation between the spectra (the power transfer function 
of the smoothing) is 








PUD): .- —iwht —i(k—l)wAt 12 
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This will give us zeroes at frequencies which are multiples of 1/kAt, and 
we can avoid folding the first two side lobes of this function onto the 
main lobe and still take a folding frequency as small as 2/kAt. Such a 
choice will fold the second, fifth, sixth, etc. side lobes onto the first side 
lobe, and it will fold the third, fourth, seventh, eighth, etc. side lobes 
onto the main lobe. We obtain such a folding frequency by retaining 
only one in every k/4 of the X,’s. These decimated* X,’s may, in par- 
ticular, be obtained by summing the X,’s in non-overlapping blocks of 
k/4, and then summing these block sums in all possible (overlapping) 
sets of four successive blocks. (This requires (k + 8)/k additions per 
original value.) The estimated spectrum below 1/kAt has to be multi- 
plied by 


and only aliases which are usually negligible will have been superposed 
on the principal aliases. About one kth of the original principal spectrum 
will be available for analysis. 

The stability obtained by this process can be easily compared with 
that obtained by using all X, and taking Ar = kAt/4. In each case, the 
width of the elementary frequency bands is approximately 1/27", 
where 7", has slightly different, but not substantially different values. 
The process just described yields nearly the same stability as Ar = kAt/4, 
and usually involves much less computation, besides avoiding serious 
aliasing. It will almost always be preferred to using Ar = hAé withh > 1. 

Other schemes of smoothing by groups are discussed in Section B.17. 


18. PILOT ESTIMATION 


The prewhitening procedure demands a rough knowledge of the spec- 
trum for its effective use. Sometimes this rough knowledge can be ob- 
tained from theoretical considerations, or from past experience, but in 
many cases it must be obtained from a preliminary (pilot) analysis of 
the data. Such pilot analyses should be as simple and cheap as possible. 
We now discuss a pilot analysis giving very rough results quite easily. 

Table III exemplifies a form of calculation which is easily carried out 
either entirely by hand, or with a desk calculator. The symbols “6” and 
“g” refer to differences and sums of consecutive numbers in non-over- 
lapping pairs. Taking the numbers in non-overlapping pairs is not neces- 

* Although this word should refer strictly to the deletion of only every 10th 


item, we shall apply it to the retention of only every jth item, for whatever 7 may 
be relevant. 
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Tasie III — Computation or Pitot ESTIMATES 











q Xq 5Xq | (6X9)? | oXg b0Xq | (50Xq)?) oXq b027Xq | (d0?Xq)? 

1 3 

2 4 1 1 7 

3 —1 

4 —2 —l 1 —3 —10 | 100 4 

5 2: | 

§ 7 5 25 9 

7 5 

8 —1 —6 36 4 —5 25 13 9 81 

9 —3 

10 2 5 25 —1 

11 5 

12 4 —1 1 9 10 100 8 

13 7 

14 3 +4 16 10 

15 4 

16 —1 —5 25 3 —7 49 13 5 25 

17 —4 

18 2 6 36 —2 

19 4 

20 0 —4 16 4 6 36 2 

21 1 

22 -—1 —2 4 0 

23 1 

24 2 1 1] 3 3 9 3 1 I 

25 4 

26 3 —1 1 7 

27 0 

28 —4 —4 16 | -4] —11 121 3 

29 —] 

30 —2 | -1 1 —3 

3l —2 

32* —2 0 0 —4 —1 1 —7 —10 ; 100 
Totals 205 441 207 








CONTINUATION OF TABLE III To THE RicgHt (CoMpPREsSSED) 

















q Xq 603X q (603X g)? aXq b01Xg (504X q)? oXq (o5X q)? 
8 17 
16 21 4 16 38 
24 5 
32 —4 —9 81 1 —37 13869 + 39 1521 
97 1369 1521 











(* Note: 82 = 25.) 


sary, but saves much calculation at little cost in accuracy. (In this table 
sums and differences are entered in the lower of the two lines to which 
they correspond.) 

The final sums of squares are roughly proportional to the power in 
successive octaves coming down from the folding frequency. They differ 
by only a constant factor, equal to the number 2° of values X, used, 
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out the very lowest frequencies. This fact allowed us, in dealing with 
continuous records, to treat the “signals”? being processed as if they had 
zero means. In dealing digitally with equally spaced data, all frequencies 
down to zero are transmitted, unless we take special precautions. Conse- 
quently, we must give serious attention to the very lowest frequencies. 

(We must now distinguish between power (in the sense of a line) at 
zero frequency and power density at zero frequency. The power spectrum 
of a stationary random process with zero means may have finite power 
density at zero frequency without having finite power there. However, 
finite power at zero frequency may be introduced into the data in meas- 
urement. It would then be desirable to filter out the power at (exactly) 
zero frequency without affecting the power density at and ncar zero 
frequency due to the stationary random process, but this cannot be done 
perfectly.) ; 

The need for such attention becomes clear when we consider the effect 
of ‘‘small” displacements of the average. Suppose that most of the ob- 
servations (say about 999 in 1000) lie between —100 to +100, with a 
few falling outside one limit or the other. This would be the case when 
the standard deviation is about 30, the variance about 900. If the average 
of the observations were 5 or even 10, we might or might not detect at a 
glance its failure to be zero. 

The total power is the square of the average (de power) plus the vari- 
ance. Numerically, perhaps 25 + 900 = 925 or 100 + 900 = 1000. All 
the de power belongs to the very lowest frequency band, whose width is 

J 
Af = oT" 
If we have data at one second intervals for a period of 15 minutes, a total 
of 900 points, we will have a folding frequency of one-half cycle per 
second, and 900 elementary frequency bands before we reach the folding 
frequency. Thus up to one tenth of all the power may be concentrated in 
one 900th of the spectrum, so that the lowest frequency band has a power 
density up to 90 times that of the average of the 899 others. It is not 
surprising that precautions need to be taken to deal with such possi- 
bilities. (After all, our standard spectral windows have side lobes more 
than 1 per cent the height of the main lobe.) | 

Slow trends, which may reasonably be regarded as zero-frequency sine 
waves, just as constant displacements are regarded as zero frequency 
cosine waves, are not nearly so likely to involve quite so substantial 
excesses of power density, but instances of this may and do arise. 

Any way of dealing with these effects must essentially remove the 
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lowest elementary frequency band, or both this band and the next to 
lowest one. In the process it will also have to eliminate some parts of the 
next higher elementary bands as well, since we cannot design a filtering 
procedure entirely free of side lobes. Two classes of ways of doing this 
are important. Either the X,’s can be linearly altered, as by subtracting 
the mean of them all from each of them, before the mean lagged products 
are calculated — calculated from modified data as if they were original 
data — or additional computations may be made and combined with 
either the mean lagged products or their cosine series transforms. Thus, 
for example, the mean of all data may be calculated and the square of 
this mean subtracted from each and every mean lagged product. The 
effect of all of these modifications can, however, be summarized as apply- 
ing the finite cosine series transform to 


CG; — Er 


where k identifies a specific method of modification, rather than to the 
C, alone. 
In place of 


ave {[Pia(f)} = Qia(f) *Pa(f), 


we shall now have 


ave {Pias(f)} = Qia(f) *Palf) = [Qia(f) — Ra(f)] + Palf) 


where Rx(f) is related to the /;, in the same way that Q,(f) is related to 
the C,. 

Details for certain special choices for £;, are given in Section B.19. 
It is there concluded that, among others, satisfactory choices for prac- 
tical calculation appear, for the present, to be, for removing possible 
constants, 


Eu, = (XP (independent of 7) 


and, for removing the effects of both possible constants and possible 
linear trends, 


2 
By = +e -4- 2r _ Or) ge — 5) 
16 nv n nN? > 
where X+ and X~- are the means of the right- and left-hand thirds of the 
X values. 
WARNING: It will almost never be wise to fail to use some /;, in a digital 
computation. 
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ANALYSIS IN PRACTICE 


The two sections which follow discuss the questioning and planning 
required whenever a digital analysis of equally spaced data is to be 
made, and exhibit a sample sequence of calculation formulas which 
might result from such planning. They are intended to summarize the 
previous material in its application to analysis. (Application to planning 
for measurement is treated next after this.) 


20. PRACTICAL ANALYSIS OF AN EQUALLY SPACED RECORD 


We may logically and usefully separate the analysis of an equally 
spaced record into four stages — each stage characterized by a question: 

(a) Can the available data provide a meaningful estimated spectrum? 

(b) Can the desires of the engineer for resolution and precision be 
harmonized with what the data can furnish? 

(c) What modifications of the data are desirable or required before 
routine processing? 

(d) How should modification and routine processing be carried out? 
Failure to adequately consider any one question properly, or failure to 
apply any one answer, can make the entire analysis worthless. 

The data presented will have come about by measuring some physical 
phenomenon at regular intervals. Thus, 

1. the spectrum of the phenomenon 

2. the frequency response of the instruments used to make the meas- 
urements 

3. the probable magnitudes of measuring, and recording or reading 
errors, and 

4. the time separation between adjacent values 
are all relevant. 

The first stage of consideration is to inquire generally about these quan- 
tities, and to determine whether either aliasing (see Section 12) or back- 
ground noise is so heavy as to make the values almost wholly useless. © 
Thus, if the spectrum is believed to extend up to 10 megacycles with 
substantial intensity, if the measuring equipment is flat to 1.2 kilocycles © 
and is 60 db down at 5 kilocycles, and if the values are measured every 
sto of a second, we may as well stop here and go no further, since the 
whole available spectrum (up to 100 cycles) will be aliased more than a 
dozen times over. (The 1.2 kilocycle measurement bandwidth, which will 
be aliased 12 layers deep, will control rather than the 10 megacycle 
phenomenon bandwidth.) 

If, on the other hand, the equipment was flat to 10 cycles, down about 
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6 db at 20 cycles, 15 db at 30 cycles, and 60 db at 50 cycles, we would 
not expect any irremovable aliasing difficulties, and would expect to be 
able to estimate the spectrum up to some moderate frequency — up to, 
say, 20 cycles, 30 cycles, or 40 cycles, depending upon how much back- 
ground noise was present. (The energy above 100 cycles would not be 
recorded.) 

In the next stage we should inquire into 

1. the frequency resolution required 

2. the fractional accuracy of estimation required, and 

3. the total duration of data available, and the number of pieces into 
which it falls. 

Items 1 and 3 can be combined and converted into the approximate 
number of elementary frequency bands (number of degrees of freedom 
— see Section 9 which is based on Sections 6 to 8) possibly available for 
each of the proposed estimates. This number can then be compared with 
the number of degrees of freedom required (also see Section 9) to give 
the desired fractional accuracy. If these are consistent, or if the desired 
accuracy, or the desired resolution, or both can be modified to make 
them consistent, then there is a good chance that the data can be per- 
suaded to yield the desired results, and further inquiry is indicated. If 
not, we should stop here. 

Explicit relations among duration, resolution, and fractional ac- 
curacy, the latter expressed in terms of 90 per cent interval (cp. Tables 
I and IJ), are given in Section B.23. These lead to an approximate 90 
per cent spread, expressed in db (decibels), of 


14 
4/ (total duration in secs) (resolution in eps) — 4— 4 (number of pieces) 








a result which may often be conveniently used in such an inquiry. 
At the beginning of the third stage, information should be sought as to 
1. over what range of frequencies the spectrum is desired, and 
2. whether any lines or high and narrow peaks are to be expected, 
and at what frequencies. 
Guided by this information, it should be possible to decide whether either 
a. smoothing by groups (as in Section 17) to reduce computation 
without loss of low-frequency information, or 
b. rejection filtration (as in Section 16) to suppress well-established 
lines or high and narrow peaks, 
or both, are desirable. If desirable, they are then carried out before or 
during the next step. 
Unless advance information about the spectrum is exceedingly good, 
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a pilot analysis (see Section 18) to establish the rough form of the spec- 
trum will now be very much worthwhile. The result (or the very good 
advance information, if available) will now make it possible to choose a 
reasonable prewhitening procedure (or, possibly, to choose not to pre- 
whiten). Once suitable prewhitening (see Sections 11 and 15) has been 
chosen, and either carried out or planned for, the third stage is complete. 

Finally, the information on resolution and accuracy combine to specify 
the width of spectral window desired, and hence (see Section 13) the 
number of lags for which mean lagged products should be calculated. 
When these are in hand, they are modified and transformed (or, perhaps 
more simply, transformed and convolved — see Section 13), adjusted to 
screen out very low frequencies, and the resulting power density esti- 
mates are corrected for the prewhitening, and for grouping and/or re- 
jection filtration (if any) used. The final estimates are best plotted on a 
logarithmic power scale, since their accuracy will be roughly constant 
on this scale. Crude confidence limits can then be calculated from the 
number of degrees of freedom (see Section 9) which would be present 
in the individual estimates if: (i) the process were Gaussian, and (ii) 
the prewhitened spectrum were flat. (The factor of safety of Section 
8 will ordinarily be adequate.) 


21. SAMPLE COMPUTING FORMULAS 


We cannot prescribe one set of computing formulas for general use, 

since there are rational reasons for different choices. All we can do is 
illustrate a procedure which may work fairly well in many cases. (And 
our example is not likely to be the only one with such properties. If the 
reader understands, by comparison with adjacent sections, just why we 
do what we do, he can compare other procedures with this example in a 
meaningful way. He will have to understand much of what is said in 
order to do this.) 
- If X¥,,¢= 0,1, --- , n are the given observations, which. we will treat 
as if at unit spacing, it is likely that P4(f) decreases substantially as f 
goes from 0.0 to 0.5 = fy . (If it does not, then aliasing is likely to have 
been serious, and satisfactory analysis at this spacing may be Deel ) 
Prewhitening by 


x = Xt = 0.6 Xi 
which multiplies P a(f) by 
1.36 — 1.20 cos 2zf, 
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a factor increasing from 0.16 to 2.56, may be a wise prewhitening. (The 
index ¢ will now start at 1, and not at zero.) 
We calculate next 


; 1 Wee. f- 1 n a 2 
C; = t ere ines >» X t}y 
n—-?T 4 n 1 y 
namely mean lagged products with an adjustment for the mean. (I*urther 
adjustment for a linear trend might have been necessary. See Section 
19.) Let us suppose that we do this forr = 0, 1, 2, ---, 24 = m. 


(Some other choice may have been appropriate.) 
Next we calculate the finite cosine series transform 





we 


V; -|a+2% =e -cos +— + C,- cos rx | 


and the results of hanning (see Sections 5 and 18) 
Uy.= (Vo + Vi) 
GO, =Weitevet+iVar., Lsersm— l], 
Cp SV EV nes 


These can then be corrected for both prewhitening and the correction 
for the mean by forming (see Section B.21) 











“ 1. Us: 
n~ ™ 136 — 1.20 cos 
6m 
1 
Us lsrsm-l, 
2rr 
1.36 — 1.20 cos —— 
2m 
1 ; ioe 
1.36 — 1.20 cos (1 = in) Qn 


as smoothed estimates of the power density. Estimates with subscript 0 
will apply in the range just above zero frequency, those with subscript r 
near a frequency of r/(2m) cycle per observation, and those with sub- 
script m in the range just below a frequency of 0.5 cycle per observa- 
tion. 

In interpreting these estimates four cautions are important: 

(a) aliasing of frequencies (see Section 12) may have taken place, 
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(b) the estimates are smoothed with a crudely isosceles triangular 
weighting function (see Sections 5 and 13) of full width 4/(2m), 

(c) no estimate will be more stable than chi-square on (2n)/m degrees 
of freedom and, wherever the spectrum is not smooth, the stability of 
the estimates will be appreciably less (see Section 9), 

(d) adjacent estimates will not have independent sampling errors, 
though those not adjacent are at least very close to being uncorrelated. 

The units involved are such that the smoothed one-sided, aliased power 
density on 0.0 S$ f S 0.5 is approximated by twice the estimates. The 
pieces into which the variance would be divided, each coming from a 
frequency band of width 1/(2m) cycles per observation, are estimated 
by 1/(2m) times the corrected estimates. 


PLANNING FOR MEASUREMENT 


Up to this point, with the exception of part of Section 11, our discus- 
sion has been concerned (i) with what happens when certain operations 
are performed, and hence (ii) with how we should make the best. of 
what we already have. 

The third aspect — planning the measurements or observations to 
meet requirements —has not been adequately treated. (Both statis- 
ticlans and engineers concerned with measurement will agree that this is 
the most vital aspect of all, but will, unfortunately, also have to admit 
that, all too often, ‘‘salvage” work will be required because this third 
aspect was omitted, and the observations made unwisely.) 

In discussing ‘“‘What data shall we take?’’, ‘“How shall we measure it?”’, 
the same considerations will recur as in discussing ‘‘How shall we analyze 
it?’’, but (i) they will be looked at from quite different aspects and (ii) 
they will be even more important. Now, by planning in advance of data- 
gathering, we may be able either to replace useless or difficult-to-analyze 
measurements by usable ones, or to avoid making measurements which 
could never provide the desired information. 

The first basic decision has to do with the type of pOnnDS and analy- 
sis to be used. Three types are in use today: 

(1) Spaced: Analog use of intermittent recorders (photography of 
situations or of dials, etc.) or digital recording at equally spaced inter- 
vals (electronic reading of dials, photography of counters, etc.). 

(2) Mixed: Continuous recording (on film, calibrated paper rolls, etc.) 
with the intention of analyzing equally spaced values to be ‘“‘read’”’ from 
these continuous records. 

(3) Continuous: Continuous recording (FM recording on magnetic 
tape, etc.) with the intention of making an analog analysis. 
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The choice among these types will depend on their particular advan- 
tages and disadvantages, and on the availability of equipment, both for 
recording and analysis. In almost every case, however, the detailed 
problems will be surprisingly similar. 


22. CHOICE OF FREQUENCY RESPONSE 


In each instance there will be a problem of the response of the ob- 
serving and transmitting or recording elements to high frequencies. 
When less quantitative studies are made, it is usual to worry whether 
the high-frequency response is large enough to “follow” the phenomena 
precisely. To be sure, if recording is only at intervals, and the needle is so 
blurred as not to be read, the high-frequency response may indeed be 
reduced by filtering. Such filtering is too likely to be regarded as un- 
fortunate rather than helpful. Effort tends always to be applied for 
“faithful” recording. This is appropriate for recording specific individual 
time histories for visual study, but is often most inappropriate for re- 
cording sample time histories for statistical study with the aid of sensitive 
filters (analog or digital). (When the recording is continuous, be it on film, 
oscillograph paper, or magnetic tape, the “writing” means has a limited 
frequency response, and this will usually help to keep the record from 
blurring.) 

When the analysis is to be made on equally spaced data, whether the 
recording be continuous or equi-spaced, thereisa real problem of aliasing. 
And there is need for a basic choice of a frequency cutoff, usually in terms 
of two frequencies such that (i) the experiment is only concerned with 
frequencies up to the lower one, and (ii) frequencies beyond the upper 
one will not be recorded. The need for such a choice in a continuous 
system may not appear’ to be so acute, since only problems of noise or 
non-linear distortion are involved (see Section 11). Yet in practice, it 
will almost always be made — indirectly — by the choice of a writing 
speed (which implies a frequency cutoff for a continuous recorder). 
Economic pressures to reduce both the volume of record, and the extent 
of measurement and computation, act to lower the frequency cutoff, while 
desires to follow the spectrum to higher frequencies act to raise it. The 
proper choice comes from balancing these pressures. 

Sometimes in mixed systems, when continuously recorded data is to 
be subjected to equi-spaced analysis, an attempt is made to compromise 
matters by recording with a high cutoff, and then asking that the 
measurements of this record be “eye averages’’ over periods long enough 
for the record to show considerable variation. Such compromises do not 
seem to work nearly as well in practice as their proponents suppose. Re- 
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placing the “averages” by the results of “reading to the line” at equi- 
spaced points often seems to give better results, even though a smaller, 
but unknown amount of aliasing is thus replaced by a larger, known 
amount. Putting the filtering into the observing and writing equipment, 
rather than into the (human) measurer and transcriber, will usually do 
even better — better by a large margin. 

If one can be confident of the upper limit, beyond which the power 
spectrum will not be needed, it is usually best to record with a related 
frequency cutoff, thus reducing noise complications, aliasing difficulties, 
and the necessary bulk of the record. 

Conversely, however, points must be recorded or measured fre- 
quently enough (or a high-enough writing speed used) so that aliasing 
(or loss of high-frequency response) is not serious. (I’or a given maximum 
usable frequency, the sharper the cutoff, the less stringent this require- 
ment.) 

To summarize, the problems surrounding aliasing should lead to the 
choice of a frequency cutoff which is usefully described by two frequencies 
(which may reasonably be in the ratio of 1 to 2): 

(a) a lower frequency, which is the highest at which important power 
spectrum estimates will be made, and 

(b) a higher frequency, at and above which no serious amount of re- 
cording is done. 

Both of these need to be chosen before settling finally on observing and 
recording equipment. If equi-spaced data is produced, the folding fre- 
quency may be as low as half-way between these two frequencies. 

A prime essential to keep in mind is that all measurement, transmis- 
sion, and analysis systems are essentially band-limited. It is always in- 
advisable to try to cover too many octaves of log frequency while using 
exactly the same techniques. 


23. DURATION OF DATA REQUIRED 


Instead of trying to compromise resolution and stability within the 
limitations of available data, we may now consider the costs and ad- 
vantages of getting still more data, or, perhaps, somewhat less data. 
We face a three-way compromise among effort, resolution, and stability 
(precision) of estimate. 

Effort has to be measured in various ways, but the duration of initial 
record will almost certainly have to be considered as one measure. It is 
shown in Section B.23, where both precise definitions of the quantities, 
and a corresponding formula for the necessary numbers of pieces of a 
given length will also be found, that 
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oe 200 (pieces) 
(90% range in db)? 3 
(resolution in cps) 





Dole 


(total duration in seconds) = 





If, for example, a resolution of 0.1 eps is to be obtained from 6 pieces of 
record and is to furnish stability of +2 db for (on the average) 75 of 
the individual estimates, then the necessary duration will be 


ages eI 150 seconds. 


This applies equally to analog processing of continuous records or to 
digital processing of spaced records, so long as we apply the best methods 
which we know to a shape of spectrum which is not exceptionally diffi- 
cult to handle. 


24, AMOUNT OF DIGITAL DATA-HANDLING REQUIRED 


If spaced data are to be digitally processed, both the number of data 
points to be used and the number of multiplications involved are of 
interest. 

If we can easily build in the desirable frequency cutoff, and have to 
resolve a number of equally spaced bands spaced evenly from zero fre- 
quency to some maximum frequency, then we will require about 

[3 600 


5 + (00% range in db + (pices) | (number of bands resolved) 


data points and, roughly about 


(3 + G09; ange UBF +3 (pieces) ) (number of bands resolved)” 
multiplications. 

These last two results often give only preliminary indications. Aliasing 
difficulties will increase these numbers. The possibility of smoothing by 
groups will decrease them. Details and possible modifications of the 
proposed system of data gathering and analysis need to be studied care- 
fully before final estimates of the number of data points and the rough 
number of multiplications are finally settled upon. 


25. QUALITY OF MEASUREMENT AND HANDLING 


In every case, careful consideration should be given to the quality of 
measurement and data handling required (in terms of the dangers of, 
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e.g.: time-varying frequency response, introduced noise, intermodulation 
distortion, etc.). An extensive catalog would be out of place here, since 
the problems are basically those of instrumentation engineering. But a 
few reminders may indicate the diversity of problems which might arise. 

A camera may be “clamped” to some object to record the relative 
orientation of that object and something visible to the camera. The 
mounting of the camera is never perfectly rigid, and vibrations will 
occur ordinarily at frequencies far above the data-taking rate. Whatever 
the frequency, these vibrations will introduce “noise” into the record. 
At least an order-of-magnitude calculation of the effects of likely vibra- 
tion is needed. . 

Storage of a signal on magnetic tape will be a part of many measure- 
ment-analysis systems. Because only rough spectra are wanted, AM 
(amplitude modulation) recording may be planned. If the fact that AM 
recording and playback is subject to considerable fluctuation in over-all 
gain (db’s, not tenths db) is neglected, measurement planning may be 
quite misleading. 

In a complex analysis, where several spectra and cross-spectra (whose 
analysis we have not specifically discussed) are involved, it might be 
planned to plot the estimates of each spectrum and cross-spectrum 
against frequency, draw smooth curves, and compute derived quantities 
from values read from these curves. Such a process has led to great 
difficulties in certain actual situations, because of the ‘‘noise’”’ introduced 
by such visual smoothing which appears to have distinctive but unknown 
properties. Such a graphical step may appear to be good engineering, 
but it cannot be high quality data handling. Its use may nullify the 
careful selection of other data processes, some of which are delicately 
balanced. 

Graphical analysis should ordinarily be reserved for: 

(a) display of whatever spectrum or function of spectra is really a 
final output, 

(b) description of the actual effects of computational procedures, and 

(c) trouble-shooting. 


26. EXAMPLE A 


Suppose first that the spectrum of some aspect of the angular tracking 
performance of a new radar is to be obtained; that angular tracking can 
only be studied by photographing the target with a camera clamped to 
the antenna; that frequencies near 0.27 cps are of special interest; that 
the spectrum of tracking performance at higher frequencies is relatively 
flat up to 10 eps and then falls rapidly enough to be negligible beyond 40 
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cps; that estimates at all frequencies up to 25 eps are desired; and that 
stability to +1 db is derived. What are the requirements? 

The total amount of tracking required is fixed by the resolution re- 
quirement near 0.27 cps, which we may suppose to be either 0.05 eps or 
0.02 cps. These lead, respectively to durations of 


( + 50 + ”) ee > 1000 seconds 


0.05 
and 
(3 + 50 + 2) 14 9500 eneonids 
2 3/ 0.02 ; 


Single stretches of either 16 or 40 minutes continuous tracking are al- 
most certain to be out of the question. The length of piece available 
would depend on the aspect of tracking performance studied, but a fair 
figure for this illustration might be 200 seconds. Going to Section B.23 
for the necessary formula, we find 


1 
~ + 50 
(number of pieces) = —_*___, = a ety 
(200)(0.05) — 3 : 
or 
1 
| - + 50 
number of pieces) = ———=———_ = —= = 
(number of pieces) = . as 13.7 
(200)(0.02) — ° 


From a purely experimental point of view, these amounts of data are 
moderately hard to substantially hard to obtain, but we may suppose 
them available as far as radar and target availability are concerned. 

We come next to data taking and availability problems. We must 
study the spectrum up to 25 cps. Since the spectrum is negligible only 
above 40 eps, our folding frequency must be at least 32.5 cps, which 
would fold 40 cps exactly back to 25 eps. Hence we need at least 65 
frames a second. Consideration of available frame rates bring us to 64 
frames a second as probably reasonable. This is 12,800 frames in each 200 
second piece, a total film reading load of between 50 and 150 thousand 
frames. This will require some hundreds of man-days of film reading, 
but may perhaps be faced. 

To calculate directly the rough number of multiplications involved, we 
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may begin by assuming that we are going to require the 0.05 or 0.02 cps 
resolution all the way from 0 to 25 eps. Were this the case, then we would 
require to resolve from 





25 
0.05 — au 
to 
25 
ie = 1,250 


frequency bands. The corresponding ee of multiplications range 
from 


[4.5 + 450 + 3 (pieces)] (500)” = 120 million 
to’ 
[4.5 + 450 + 3 (pieces)] (1,250)* & 750 million. 


The running time of an IBM 650 calculator on such a problem is about 
10 hours per million multiplications, so that between 


1,200 hours = 30 shift-weeks 
and 
7,500 hours = 188 shift-weeks 


would be required. Clearly these machine times are out of line, and 
attention should be given to ways of reducing this aspect of effort. 

An application of smoothing by groups seems most likely to be effec- 
tive, especially since the high resolution is only wanted near the low fre- 
quency of 0.27 cps. Let us suppose that, in view of the supposed rather 
flat spectrum out to 10 eps, the engineers concerned will be content with 
two spectrum analyses, one with 0.5 cps resolution extending all the way 
to 25 eps, and the other with 0.02 cps resolution extending only to 1 eps. 
What effect will this have on the computational load? 

Notice first that it will have no effect on the radar-and-target operat- 
ing and film-reading loads. These were fixed by the resolution-precision 
requirements, and by the combination of this with the upper limit of the 
actual spectrum affecting the camera. Replanning details ot the analysis 
will save nothing on either of these. 

The broad-frequency low-resolution analysis will resolve about 


25 


05 = 50 bands 


MEASUREMENT OF POWER SPECTRA 247 


and require roughly 
[4.5 -+ 450 + 3(14)]-(50)° = 1.24 million multiplications 


~ (since we shall need 14 pieces to obtain the required precision at a resolu- 
tion of 0.02 eps). This would require about 12.4 hours machine time, a 
quite reasonable amount. 

The preparation of data for the low-frequency high-resolution analysis 
—if we follow the suggestion of Section 17, requires less than 1.5 
additions per original frame, since each datum contributes to four means. 
This is at most 0.2 million additions and can probably be combined with 
the next step so as not to involve substantial machine time. 

The conduct of the low-frequency high-resolution analysis will resolve 
about 

a = 50 bands 
and will require about another 12.4 hours of machine time. 

Thus we have reduced machine time to about 25-30 hours, in pleasant 
contrast with the remaining requirements of some hundreds of hours of 
film reading and 14 test runs of 200 seconds each. The balance is ap- 
proximately restored. 

Our apparently blind use of the multiplications-required formula has 
concealed one important point. Our calculation of the time required for 
the high-frequency low-resolution analysis tacitly assumed that we have 
processed no more of the data than is required to meet the actual resolu- 
tion-precision requirement. 

The loosening of resolution from 0.02 eps to 0.5 eps in this part of the 
analysis has reduced by a factor 25 the amount of data which must be 
processed to meet the +1 db (90 per cent) requirement. Hence the two 
hours machine time is predicated on processing only 34th of the available 
data. If only about zs of the data isto be processed for the high frequency 
analysis, then it will be desirable to take the most typical 8 or 10 sec- 
onds from each piece. The losses due to end effects will be somewhat 
greater, it is true, but the advantages of increased coverage of the effects 
of unplanned variation, consequent on using parts of all 14 runs, far 
outweigh such considerations. 

It would be possible to use only one run for the high-frequency analy- 
sis, a possibility which emphasizes the fact that +3ths of the film reading 
is done to obtain the raw material for averaging, for filtering out high 
frequencies. If the hundreds of man-days of film reading look out of 
line, and zf the line from the radar to the target is known not to change 
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rapidly (with respect to an inertial frame of reference), then we are driven 
to consider whether the ‘‘clamping” of the camera to the antenna could 
modified in such a way as to provide a frequency cutoff between antenna 
position and camera position. What would be desired would be a reliable 
mechanical filter with a cutoff at 1 or 2 eps, and substantial, reproducible 
transmission up to, say, 0.5 cps. If such a mount could be taken down 
from the shelf, then it would suffice to make (a) one 200-second run with 
a stiff mount and 64 frames per second, and, say, (b) thirteen 200-second 
runs with a mount of such designed softness, and, say 4 frames per sec- 
ond. The total number of frames for reading would now be 12,800 for 
run (a) and 800 for each run (b), a total of about 23,000 frames. This 
might require about a man-month to read, a saving of several man- 
months. Unfortunately, such a sharply-tuned low-pass mount would not 
be likely to be on the shelf. 


27. EXAMPLE B 


As a second example, suppose a new solid-state device develops a noise 
voltage with a power spectrum roughly proportional to 1/f’ when under 
test under most extreme circumstances — circumstances so extreme that 
its average life is 30 to 50 milliseconds, and suppose that the detailed 
behaviour of this spectrum is believed likely to provide a clue to the 
proper theoretical treatment of some of the properties of this device. 
Suppose further that, while it was believed that the shape of the spec- 
trum of the noise from different examples of this device was the same, 
the voltage levels of different devices were quite different. It might be 
reasonable to ask for spectral measurements to £0.25 db resolving 1 eps 
and covering from 1 eps to 500 cps. Direct measurements are likely to be 
most difficult, for the power between 499 and 500 eps is about 73p7p95th 
the power between 1 and 2 cps, a difference of 51 db in level. Our re- 
cording and processing equipment is not likely to have the dynamic range 
required for direct analysis. 

Clearly we should prewhiten our noise as early in the measurement and 
analysis system as we reasonably can. Fortunately, prewhitening here is 


R 


Fig. 12 — RL voltage divider. 
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operationally simple. A RL voltage divider, as indicated in Fig. 12 will 
introduce an attenuation of voltage, if the load impedance is high, 
amounting to 
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which will be initially constant, and then decrease 6 db per octave, with 
a corner atw, = R/L,f. = R/27L. Asa first step in measuring a spectrum 
out, to say, f = 2R/27rL, at which frequency the prewhitened spectrum 
would be down about 7 db, such a change would be useful. The range of 
frequencies which could be usefully studied would not be appreciably re- 
duced by such a change, even though the low frequency power level 
would be greatly reduced by the prewhitening network, since the low- 
frequency power level would not be seriously reduced below the former 
power level at the corner frequency. If one could have been studied, 
the other can be studied. 


28. EXAMPLE C 


The irregularities in the earth’s rotation have been studied by Brou- 
wer,’ who reduced the available observations (times of occultation and 
meridian passage) by averaging over individual years. He states ‘‘oc- 
cultations so reduced in recent years have been demonstrated to yield 
annual means essentially free from systematic errors if the observations 
are well distributed over the year. ...The 6’s may themselves be the 
accumulations of numerous smaller random changes with average inter- 
vals much smaller than a year. The astronomical evidence throws no 
further light on this, though perhaps something may be gained by an 
analysis of residuals in the moon’s mean longitude taken by hinations.’’”* 
These comments suggest that astronomical data can supply values once 
a year, possibly no more frequently, and may be able to supply values 
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about 13 times a year (once per lunation), certainly no more frequently. 
Let us accept the first possibility as a basis for an example. (This is the 
best example we know of a situation where equally spaced data cannot, 
in principle, be had at a finer spacing.) 

The information most nearly directly supplied by the astronomical 
observations is At, the difference between ephemeris time and mean solar 
time. Brouwer discusses two statistical models for its structure, both of 
which are most easily described in terms of the behavior of the second 
differences of the observations. In the first, the true second differences 
are constant over periods of varying length. In the second model, the 
true second differences are independently and randomly distributed. In 
cither case, observational errors, independent from observation-period 
to observation-period also contribute to the observed A?’s. 

If we were to plan an observational program to decide between these 
hypotheses by spectral analysis we need first to specify the alternative 
spectra. The first model seems never to have been made as precise sta- 
tistically as the second. Brouwer’s fitted curves correspond to constancy 
over periods of from 4 to 15 years. We should like to get a general idea 
of the possible spectra corresponding to this model without making the 
model too specific. Consider first a situation in which, except for the 
effects of second differences of experimental errors, the observations are 
constant in blocks of five, and where the values assigned to different 
blocks are independent. The successive average lagged products (start- 
ing with lag zero) are proportional to 5, 4, 3, 2, 1, 0, 0, 0,... and it fol- 
lows that the power density is proportional to 
4 2 


5 COS 3x f/fy + = cos 4rf/fr. 


1+ * cos tf/fyt ; cos 2rf/fy + 5 


Calculation shows that this is high near zero frequency, falling rapidly 
until, beyond about f/fy = 0.3, it consists of ripples with an average 
height of less than z';th the low frequency peak. If, instead of “constant 
by fives’’, the specific model were “constant by eights” or ‘‘constant by 
tens”, still with independence between blocks, this peaking would be 
more pronounced and confined to still lower frequencies. If the lengths 
of the blocks were to vary at random, according to some distribution, 
still with independence of value, the spectrum would be the correspond- 
ing average of such spectra for fixed block lengths. The spectrum to be 
expected for second differences of annual average observations then should 
consist of a sum of two components: 

(1) a “true” component peaked at low frequencies, falling rapidly by, 
say, f/fy = 0.2 or 0.25, and continuing to f/fy = 1.00 with an average 
height perhaps 1 per cent or 2 per cent of the low-frequency value, 
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(2) an “observational error’ component, corresponding to inde- 
pendent errors in the annual averages, and hence proportional to (1 — 
cos (nf/f)). 

In case the second model should apply, the first component would be 
replaced by one with a flat density. 

lig. 13 shows the shapes of the three possible components. The natural 
way to try to distinguish between the two models by spectral analysis 
is to compare the spectral density in the middle range, say f/fy = 0.25 
to 0.5 with that in a lower range, say below f/fy = 0.25. According to 
Model I, the low-range density should be substantially higher than the 
middle-range density, the latter consisting of the effects of observational 
error (whose strength can be well estimated at the upper end of the spec- 
trum). According to Model IJ, the middle-range density should be 
slightly to somewhat greater than the low-range density, the increment 
representing effects of observational error. 

Without more detailed estimates of the relative sizes of the compo- 
nents, it would be difficult to specify exactly how many observations 
would be required to separate Model I from Model II, but 10 to 20 
degrees of freedom in each of the ranges discussed should be quite help- 
ful. This suggests 100 values of annual second differences, corresponding 
to 102 years of careful astronomy, as likely to be helpful. Since Brouwer 
gives annual values for 131 years, some 129 annual second differences are 
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Fig. 13 — Components for two models of earth-rotation irregularities: (1) ‘“‘true 
irregularity’’ component for first model, (2) ‘‘observational error’? component for 
either model, (3) ‘“‘true irregularity’? component for second model. 
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available for trial, and it may be possible to answer the question without 
waiting for many more years to pass. 

It might well suffice to estimate smoothed densities over octaves such 
as 0.0625 S f/fy S 0.125, 0.125 S f/fy S 0.25, 0.25 S f/fy S 0.5 and 
0.5 Ss f/fy < 1. Thus we might consider using the add-and-subtract 
pilot estimation method for initial exploration. The actual analysis of 


Brouwer’s data is considered further in Section B.28. 
APPENDIX A 


FUNDAMENTAL FOURIER TECHNIQUES 


In this appendix we review briefly certain aspects of Fourier transfor- 
mation. These aspects may be regarded as dealing mainly with diffrac- 
tion by slits, rectangular or graded, and by analogs made up of discrete 
“lines”. Convolution and the so-called Dirac functions are specially 
important as convenient tools. Some parts of the discussion will have no 
direct bearing on the analysis of procedures for power spectrum estima- 
tion, but are intended to familiarize the reader with analytical tools 
which are used frequently throughout the remainder of this paper, and 
which may be used to advantage in many other analyses of a similar 
nature. 


A.l Fourier Transformation 


There are several formulations of Fourier transformation which differ 
according to custom, convenience, or taste. The formulation which we 
will adopt here is the one used by Campbell and Foster. Given a func- 
tion of time, G(t), its Fourier transform is a function of frequency, and is 
given by the formula . 


sir) = [ : G(t)-e™ at isa er: 


Conversely, given a function of frequency, S(f), its Fourier transform is 
a function of time, and is given by the formula 


a = | s(s)-e af Cree 


The term “frequency” is used here, not in the probability or statistical 
sense, but in the sense of sinusoidal or cisoidal functions of time (cos wt, 
sin wt, e*”’). 

Our preference for the Campbell-Foster formulation is based on the 
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following points, arranged approximately in the order of increasing 
weight. 

1. Frequencies are expressed in cycles per second more naturally and 
much more frequently than in radians per second. (In our analysis we 
use w only as an abbreviation of 2zf, and only if it is typographically 
convenient.) . 

2. Except for the sign of the exponent in the kernels, the transforma- 
tion formulae are symmetrical. The assignment of the signs here is the 
conventional one in transmission theory. 

3. In most of the applications to communications problems, the fre- 
quency functions are rational functions of p = tw, with real coefficients. 
Hence, the reformulation of the transformation of S(f) to G(@) as 


— 1 i P pt 
c= 5 [8(2)e a 
is a natural and convenient step in the calculation of the integral by the 
method of residues. 

4. The transformation formulae correspond to the conventional rela- 
tions between the impulse response (response due to a unit impulse ap- 
plied at ¢ = 0) and the transfer function (ratio of steady-state response to 
excitation, for the complex excitation ¢“”’) of a fixed linear transmission 
network. These network functional relations are commonly regarded as 
Laplace transformations rather than Fourier transformations. As a 
matter of fact, however, the circumstances in almost all practical appli- 
cations are such that there is no essential difference between Laplace 
transformations and Jourier transformations. Impulse responses are 
zero for ¢ < 0 and vanish exponentially as > «, and transfer functions 
are analytic on and to the right of the imaginary axis (including the 
point at infinity) in the complex p-plane. On the very rare occasions 
when a communications engineer might be interested in the behavior of a 
network under energetic initial conditions, he has ways of introducing the 
initial conditions without using Laplace transforms (Guillemin”). 

It should be noted that, since G(¢) must be a real function, the real 
part of S(f) must be an even function, and the imaginary part of S(f) 
must be an odd function. The even part of G(¢) and the even (real) part 
of S(f) are cosine-transforms of each other. The odd part of G(¢) and the 
odd (imaginary) part of S(f) are negative sine-transforms of each other. 
’ It should be noted also that if G(t) and S(f) constitute a transform-pair, 
then G(—t) and S(—f) also constitute a transform-pair. Further, S(—f) 
is equal to S*(f), the complex conjugate of S(f). 
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A.2 Some Transform-Pairs 


We will now turn our attention to some transform-pairs which we will 
require directly or indirectly in the analysis of procedures for power spec- 
trum estimation. We will use special symbols for some of these trans- 
form-pairs. For later reference, these transform-pairs will be collected in 
Table IV. 

The first transform-pair, which is easily worked out, involves a sym- 
metrical rectangular time function (box ear of length 27), viz. 
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The corresponding frequency function is 


sin wT’, 


Qf) = 2Tn = 2T,,- dif 2fT 
(The values assigned to Do(t) at the end points |¢t| = 7, are those re- 
sulting from the transformation of Qo(f) to Do(t). Of course the values 
assigned at these two points do not influence the result of the transforma- 
tion of Do(t) to Qo(f)). Except for scale factors, this frequency function 
is the function dif u = sin ru/au which recurs constantly in this subject. 
It is often convenient to regard it as the diffraction pattern (in frequency) 
due to passage through a rectangular slot (in time). The behaviour of 
dif 2f7'» is shown in Fig. 14. 

The second transform-pair, which is almost as readily worked out as 
the first, involves a symmetrical triangular time function, viz. 


se | < 
pt ea ieae bate 


= 0, babea ee 


Dit) = 1 


The corresponding frequency function is 
F 2 
Qf) = Tr (mate) = T,,(dif fT'n)*. 
ee 
Except for scale factors, this frequency function behaves as shown in 
Fig. 14. 

The third transform-pair involves a so-called Dirac function as the 
time function. The Dirac function is not a function in the strict mathe- 
matical sense. It is called a “measure” by L. Schwartz.” For our pur- 
_ poses, it will only be necessary to identify 6(¢ — t)-dé formally with 
dh(t — t) where h(é — t) is Heaviside’s unit-step function, viz., 

h(t — &) = 0, t< by 
=1, t>bh 
and to interpret all integrals as Stieltjes integrals. Hence if the time 
function (to use the term loosely) is 


G@®) = 6 — b) 
then, the corresponding frequency function is 
Sf) = 
It should’ be noted that while 6 — é) is easily formally transformed 
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into a frequency function, the latter is not so readily transformed into 
the original time function. 

The fourth transform-pair involves a symmetrical pair of Dirac func- 
tions as the frequency function. Thus, the time function 


G(é) = cos wot (wo = Qfo) 
corresponds to the frequency function 


SQ) = 316 + fo) + o(f — fol. 


If the reader is disturbed over the fact that we are evidently going to 
base our analysis, at least initially, on the use of Dirac functions, he 
should note that Dirac functions are always paired with functions which 
are used widely and freely in transmission theory although they are not 
realistic in a physical sense. Functions of time, such as cos wol, which 
represent an infinitely long past and future history of activity, are not a 
bit more realistic in a physical sense than are “infinitely sharp” lines in 
the frequency spectrum. Similarly, functions of frequency, such as 
exp(—twty), whose absolute values do not vanish as f—> , are not a bit 
more realistic than impulsive ‘‘functions” of time. Nevertheless, as we 
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will see later on, these unrealistic pairs may be used as convenient bases 
for a wide variety of realistic pairs. They thus serve a very useful purpose. 

The fifth transform-pair involves a finite Dirac comb as the time func- 
tion, viz. 


q=m— 


At ; 
Vill; At) = 5 a(t-+mat) + at Dy a(t — gas) + Salt — mai). 
q=m+1 
This is clearly a discrete approximation to D,(t) for Tm = m-At. The 
corresponding frequency function, which is easily worked out with the 
help of the third transform-pair (summing the exponential terms before 
introducing trigonometric equivalents), is 


wAt 


dif 2f(m- At) 
5 asia Lo 


sin mwAt = 2(m-At) cos (rf: At) dif f-At 


Qo(f ; At) = At cot 
Except for a scale factor, the initial behaviour of this frequency function 
is illustrated in Fig. 9. Clearly, since cos 0 = dif 0 = 1, the limit of 
Qo(f; At), when At — 0 with m-At = T,, held constant, is Qo(f). This 
corresponds to the formal convergence of V,,(é; At) to Do(t). 

We have defined this finite Dirac comb with a half-sized Dirac func- 
tion at each end because the corresponding frequency function has 
smaller side lobes, relative to the main lobe, than for the finite Dirac 
comb with a whole Dirac function at each end. This is easily seen from 
the fact that the effect of adding a further half-sized Dirac function at 
each end of V(t; At) is to add At-cos mwAt to Qo(f; Al). 

The frequency function Qo(f; At) is periodic, with a period of 1/At cps. 
It is symmetrical about every integral multiple of 1/(2At) cps. Thus, it 
has an absolutely maximum value of 2m- Aé at the integral multiples of 
1/At cps. It is zero at the integral multiples of 1/(2mAt) eps which are 
not integral multiples of 1/At eps. For large values of m and small values 
of wAt, it behaves approximately like Qo(f). 

The sixth transform-pair involves an infinite Dirac comb in time, and, 
as it turns out, also an infinite Dirac comb in frequency. The time func- 
tion is the formal limit of Vn(t; At) as m — ©, namely, 


q=0 


V(t; At) = At» >) a(t — qAd). 


g=—0 


The corresponding frequency function is 


A(i a) = = (s-4) = av (52). 
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This may be surmised from the fact that 


1/(2A4¢) 
| Qo(f; At) df = 1 for any m 


—1/(2At) 


while 


lim I Qo(f; dt) df = lim Si(2nmeAt), (whore Sia) = | DY ay) 
M>0 V—e€ mo 0 

= 1 for any ¢ in 0 <e < 5a 
The result may indeed be obtained by applying the fourth transform- 


pair with T,,, = m-At to the formal Fourier series representation of the 
infinite comb 


V(t; At) =1+2 py cos one 


Since 
Vin(t; At) = Do(t)- VC; At) 


we also have, as we shall see in the next section, 


Qo(f; At) = Qo(f) * A (4 i 


= ¥ a(r- g). 


g=—0 


The seventh transform-pair arises from the sixth by dividing by At 
on both sides. 


A.3 Convolution 


If Git) = G.(é)-G.(t), then the Fourier transform of G(t) may be ex- 
pressed in terms of those of Gi(é) and G,(¢) as follows. 


si) = [Gx -G.0-<* at, 


= [ato-| [spc ag] eat 
ei. [ [ a. a| “82(8) db, 


= [sup - 9-80 ae 
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This relation, in which S,(f) and S2(f) are interchangeable, is commonly 
expressed in the symbolic form 


S(f) = Si(f) * So(f). 


The implied operation on Si(f) and S.(f) is called a convolution. In par- 
ticular, S(f) is said to be the convolution of S,(f) with S2(f). 
Similarly, if S(f) = Si(f)-S2(f), then 


cw = [ ” Gilt — d)-Gald) ad 


= Gi(d) * G2(¢). 


Thus, multiplication and convolution constitute an operational trans- 
form-paitr. 

(Convolution is often called by a variety of names such as Superposi- 
tion theorem, Faltungsintegral, Green’s theorem, Duhamel’s theorem, 
Borel’s theorem, and Boltzmann-Hopkinson theorem.) 

It may be noted in the detailed derivation above (putting f = 0), 
that 


[. a-e.00 a = [ sr-s)-a 


where S,*(f) is the complex conjugate of S,(f). This is Parseval’s theorem 
of which a very useful special case is 


[tera = [1s fa 


An example of convolution is supplied by the symmetrical triangular 
time function in the second transform-pair. This time function is the 
convolution of two symmetrical rectangular time functions from the 
first transform-pair, with appropriate scalar adjustments. Another 
example is the infinite Dirac comb V(¢; At), which may be regarded as 
the convolution of the finite Dirac comb V,,(é; At) with the infinite 
Dirac comb A(t; 2mAt), that is 


V(t; At) = Vin(t; At) * A(t; 2mAl). 


As the reader may easily verify, this corresponds to 


uy 1 
A (¥ a = Qilf; Al)-V (1 ai): 


Convolution of time functions occurs in communications systems when- 
ever a signal is transmitted through a fixed linear network. If the input 
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signal is G,(t), and if the impulse response of the network is W(é), then 
the output signal is* 


G@ = [ We-»-G.0) a 


= W(t) *G,(d). 


The so-called linear distortion of the signal due to transmission through 
the network can be (and occasionally is) examined in terms of the effects 
of convolution, but the common practice among circuit engineers is to 
conduct the examination in terms of the corresponding frequency func- 
tions. There are good reasons for this common practice. The most im- 
portant of these reasons are: 

1. The relation between the frequency functions is simpler, viz. 


Sf) = YQ) Sif) 


where Y(f) is the transfer function of the network. 

2. The effects of amplitude distortion of the signal and of phase dis- 
tortion (ot the unmodulated signal) may be examined independently. 
While phase distortion is critical in the transmission of pictures (fac- 
simile), it is relatively unimportant in the transmission of speech or 
music. 

3. The transmission characteristics of fixed linear networks are most 
easily calculated or measured accurately in terms of frequency rather 
than time. 

4. Fixed linear network design techniques based on frequency func- 
tions are today much further developed (simpler, more powerful, and 
more versatile) than those based on time functions. 

Convolution of frequency functions occurs in communications systems 
whenever a carrier wave is amplitude-modulated by a signal. If the input 
signal is Gi(¢), and if the carrier wave is cos wot, then the output signal, 
with suppressed carrier, is 


G(t) = Gi(t) “COS wot 


* Tt may be of some help here to think of \ as ‘‘excitation time’’, and of ¢ as 
“response time’’. In the equivalent formulation 


Git) = [ W(r)-Gilt — 7) dr 


we may think of r = ¢ — 2d as the ‘‘age”’ of input data at response time. 

At this point attention is called to a device which will be used many times to 
simplify analysis, which is to use — © and + as limits of integration, letting the 
integrand take care of the effective range of integration. In this case, if G,(A) =0 
for \ < f , and W(r) =0 for r < 0, the effective range of integration would be 
to<rAX<tor0<7r<t— fo. 
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and the relation among the corresponding frequency functions is 
S(f) = Silf) * § [8(f + fo) + Of — fod] 
= 3 Si(f + fo) po Si(f as fo). 


The convolution of frequency functions corresponding to the amplitude- 
modulation of a carrier wave is so naturally visualized simply as shifting 
the signal spectrum (frequency function) that it is almost never vis- 
ualized in any other way. It should be observed, however, that this 
point of view depends critically upon the two-sided specification of the 
signal spectrum, in amplitude and phase, to give the correct picture of 
the sidebands, whether the amplitude-modulation scheme under con- 
sideration be double-sideband, single sideband, vestigial sideband, or 
two-phase (as in TV chrominance signals). Further, the two-sided specifi- 
cation of the modulated-carrier spectrum is essential for a correct picture 
of the demodulation process used to recover the signal. 

For present purposes we will be interested in convolution not only as 
a tool for the synthesis of new transform-pairs but also as an analytical 
tool. For example, by regarding a time function G(é) as the product of 
two other time functions Gi(t) and G2(é) we can make use of the re- 
lation S(f) = Si(f) * S.(f) to reach insights about S(f) which do not 
come easily from the explicit form of S(f). 

To make convolution a useful analytical tool, we have to visualize it 
in some convenient way. This may be done in three ways. The relative 
merits of these three points of view depend upon the circumstances in 
any particular case. | 

In the first place, convolution may be visualized as a stretching process. 
For example, in the equation 


aw = [ ; Gilt — d)-Go(d) ad 


we visualize G2(A)-dd as a rectangular element of G2(¢), originally con- 
centrated at ¢ = \. This rectangular element is then stretched into the 
area under the elementary curve G,(¢ — d)-G2(A)-dA regarded as a func- 
tion of ¢. This elementary curve has the shape of Gi(¢) with origin shifted 
to t = X. The total effect at any particular value of ¢ is then obtained 
by integration over X. In this example, we have regarded G,(¢) as the 
“stretcher” operating on each element of Go(é). Of course, since convolu- 
tion is commutative, we may interchange the roles of the two functions. 

In the second place, if one of the functions in the convolution consists 
exclusively of Dirac functions, each Dirac function may be regarded as 
a “shifter” operating on the other function in the convolution. For ex- 
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ample, 
a — a) *G = [| a -a—)-40) a = GU - a). 


In the third place, convolution may be visualized as a weighted inte- 
gration with a moving weight function. For example, in the equation. 


aw = [ : Gilt — d)-Gald) ar 


we regard G(t) as the integral of G2(A) with weight function Gi(t — X). 
The position of the weight function with respect to the \ scale depends 
upon the value of ¢. In the event that the weight function has unit area, 
G(t) may be regarded as the moving weighted average of G2(A). (As 
previously noted, the roles of the two functions may be interchanged.) 

As an example of the use of the ideas described above, let us assume 
that we have a function G)(t) which is zero outside of the interval 
0 < t < T, and for which the frequency function is So(f). Let us gen- 
erate a periodic function G(t) by convolving Go(f) with A(t; 7). Then, 
since 


G(t) = G(t) « AC; T) 
the frequency function corresponding to G(t) is, from Item 7 of Table IV, 
s(t) = siis)-v (FF). 
ae 
As we expect, S(f) consists of “lines” (of infinite height but finite area) 
at uniform intervals of 1/7’ cps. The complex intensities (areas) of these 


lines represent the amplitudes and relative phases of the terms in the 
conventional Fourier series representation of G(t). Thus, 


G(t) 


[sce a 


1 3) (2) (12eqt)/T 
— So —1°é Mot : 
T q=—-* T 


As a second example, which is in a sense the dual of the first, let us 
assume that we have a function G)(¢) for which the frequency function 
So(f) is zero outside of the band —fy < f < fy . Let us generate a discrete 
time series G(t) by sampling G(t) at uniform intervals of 1/(2fo) seconds. 
If we regard sampling as a multiplication by (or as amplitude-modula- 


MEASUREMENT OF POWER SPECTRA 263 


tion of) an infinite Dirac comb, then 
1 
G(t) = : ; an}. 
(t) = Go(t)-A (: 5 +) 
Hence, the frequency function corresponding to G(é) is 


SY) = So(f) * VF; 2fo), 


or, explicitly, 
S(f) = 2for De Sof — 24/0). 


If this frequency function is multiplied by a frequency function S,(f), 
where 


Si(f) mr lf| < fo 


0, If] > fo 


I 


it will revert to So(f). Thus, 
Si(f) Sf) = Soff). 
Hence, if Gi(¢) is the time function corresponding to S:(f), namely, 


SIN wot 
wot 





Gilt) = 


? 


then 


Thus, sampling Go(t) to get the discrete time series G(¢), and convolving 
G(t) with Gi(¢), restores Go(é) exactly. This result reflects the well-known 
sampling theorem in information theory. The effect of sampling Go(é) at 
uniform intervals of other than 1/(2f) seconds is readily visualized. 


A.4 Windows 


If a time function is even (and of course real), the corresponding fre- 
quency function is real (and of course even), and conversely. These cir- 
cumstances will prevail when we deal with autocovariance functions, 
power spectra, and appropriate weight functions. Under these circum- 
stances, the weight functions will be called windows. Such windows will 
be considered in transform-pairs, and the members of any pair will be 
distinguished as the lag window, and the spectral window. 
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Time windows convolved with periodic functions of time have been 
used by Guillemin,” under the name “scanning functions”, to examine 
the behavior of weighted partial sums of Fourier series. We use them in 
Sections B.4 and B.10 where we call them data windows, and their 
Fourier transforms (which may be complex) frequency windows. 


A.5 Realistic Pairs from Unrealistic Pairs 


Transform-pairs which involve Dirac functions are very easily con- 
verted into a wide variety of realistic pairs. As an example, let us con- 
sider the sixth pair (infinite Dirac combs) which requires two convolu- 
tions for conversion to a realistic pair. If we convolve the time functions 
of the first and sixth pairs, taking 7, <«< At, we get a time function 
which represents an infinite train of narrow rectangular pulses of unit 
height. The corresponding frequency function still consists of Dirac 
functions but these now do not have a uniform intensity. If we next 
multiply the time function of this pair by the time function of the first 
pair, taking 7',, >> At, we get a time function which represents a long but 
finite train of narrow rectangular pulses. The corresponding frequency 
function is continuous and consists chiefly of very narrow peaks of finite 
height approaching zero as f— ©. 

A sinusoidal carrier wave of finite though great length may be repre- 
sented as the product of the time functions of the first and fourth pairs 
with 7'm >> 1/fo. The corresponding frequency function is continuous 
and consists of very narrow peaks at +f , with much lower subsidiary 
peaks of height approaching zero as f > ~. 

If the time function of the third pair is convolved with the time func- 
tion 


Gt) = 0 t<0 
= Ror i>0O, 


the resultant frequency function is 


_ 1 ~ into 
BY) Tear 
of which the absolute value falls off asymptotically like 1/f asf o, 
however small 7(>0) might be. 

In line with this discussion, it should be noted that a realistic “white 
noise” spectrum must be effectively band-limited by an asymptotic fall- 
off at least as fast as 1/f’. Under certain circumstances, however, we may 
assume that the spectrum is flat to any frequency. Let us suppose that 


bo 
o> 
or 
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the spectrum is in fact 
2 


“ee Pn 
P(f) = = (y” [pow 


Q- E ° 2 
where o is the variance. The autocovariance is 
2 = 
C(r) = oe, 


If we transmit this noise through a network with an effective cutoff fre- 
quency well below f. , we may assume for an approximation that 


and, therefore, that 


C(r) & s 6(r) 


although such an assumption is unrealistic if carried to indefinitely high 
frequencies (the input noise would have infinite variance). Hence, if the 
impulse response of the network is W(é), the autocovariance of the out- 
put noise is 


Coutts SF t;) 


ave 1a W (71) X(t — 71) dry 


: ie W (re) - X(t; — 72) ins 


_ I W (ry) W (72) Ch — t; ee bd + re)dr1 dre 


2 oe) 
aw — W (r1) ‘Wr — 1; + t;) dr. 
In particular, the variance of the output noise is 
a } 
Cou) Sf IW OdE dr 
Whe — 
which by Parseval’s theorem is equivalent to 
2 (2) 
Cul wf YO) Pa 
the — 0 


where Y(f) is the transfer function of the network. These results are 
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realistic. (The variance of the output noise is finite and approximately 
correct). 


A.6 Some T'rigonometric Identities 


In this section we develop some trigonometric identities which will be 
needed later on. We start with the equation | 


> cos (Wy + 2hu) = sin (a “Fb Lu @ +b + iu 


sin u 


cos [y + (6 — a)ul 
which is easily obtained by substituting 
ag ae e 

2 


in the left-hand member, summing the exponential terms and making 
some elementary trigonometric substitutions. By substituting y + 7/2 
for y we then get 


Do sin (+ hu) = 


Now, setting wu = af, and using the function introduced in Section A.2, 


sin pu _ (sin paf)/pxf _ dif pf 


» sin u (sin af )/af dif f ’ 
which, on differentiation, yields 


d (dif pf) _ (a HY ( dif’ pf _ dif’ f). 


cos xX = 


sina +b + Du. 


sin u 





sin [y + (6 — a)ul. 

















df (dif f) diff) \ dif pf diff 

Before we rewrite our summation formulas in terms of such ratios of 
“dif”? functions, we need to appreciate their behavior. For p not very 
small, (dif pf)/(dif f) behaves much like the numerator for pf small 
and moderate. The effect of the denominator is to force symmetry around 
integer multiples of $, so that the peak at f = 0 is repeated at f = 1, 2, 
3, -:+ , thus making its behavior consistent with aliasing. For 0 S f S 3 
its other effects are minor, since in this range (2/7) S dif f S 1, while 
the extrema of dif pf have shrunk from +1 to +2/(p7). For most con- 
siderations, therefore, we can approximate this ratio by the numerator. 

We now rewrite our summations as means, introducing (dif pf)/(dif f), 
nae 


dif (a +b + 1)f 
dif f : 

dif (a+ b+ Sf 
diff 


> cos (y + 2hrf) = os [Wy + (6 — a)zf] 


orori? 


> sin (Wy + 2haf) = in [fy + (b — a)zfl. 


am? 
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Differentiating with respect to f, and multiplying through by 
a be 1/20), 
we get 


dif (a+b+ of) 


dX Asin (¥ + 2haf) = ( aif 


| 
_ (tes 1)’ dif’ @+b+ If 
oe Case Bay 


eae ree d wt) cose G oa | 





7 tb+1) sin ly + 6 — af] 








with a similar formula for 


ye h cos (by + 2hzf). 


We shall now use these formulas to obtain results about the average 
values of certain quadratic functions of chance variables Xo, X1, ---, 
X,. The average value of any such quadratic function can be repre- 
sented in terms of a corresponding spectral window Q(f) in the form 


[ en)-2P~) af 


whenever 
etx xia i cos 2ngf-2P(f) df 
0 


for all suitable integers ¢ and gq, since the quadratic function can be 

expressed as a sum of multiples of terms of the form X,X,4, . To deter- 

mine the height, Q(fo), of the spectral window corresponding to a specific 

quadratic function, it suffices to consider the special case 2P(f) = 

6(f — fo), for which ave {X.X1.4,} = cos 27qfo , when the average value 

of the quadratic function for such a special set of X; is exactly Q(fo). 
If ave {X:X14,} = cos 2mgqf, we easily find that 


ave d(; ri k*) (peri *)} 


1 1 
= gE Ew 2atto = 0 
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is yy dif le +d + DF cos (—onfh + (d — eaf) 


= 3572 dif f 
_dif@+to+lfdifc+d+ lf ee 
oe ge (d—c—a+ )d)af 


~ dif (a+b + Df dif (e+ d+ Df cos d—e—at d)af 


any of these expressions being the spectral window corresponding to 


(Es) (Gi E*). 


Making the same assumption, we. find that (where n = 2¢ + 1) 


w= (Em) (Sex) 


— 2d ya gh cos 2xf(g — h) 


(i “t) (= dif’ nf _ n dif’ f 
dif f J \Qr Or 








a dine De ait i aay 
(a a) (: dif’ nf — 1 dif’ —h) 
4 \ dif f a difnf nm diff 


These expressions therefore represent the spectral windows correspond- 


ing to 
4 2 
(> Xs) 
—t 
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GLOSSARY OF TERMS 


Add-and-subtract method 


A method of roughly estimating spectra based on successive additions 
by non-overlapping two’s followed by a differencing. (18, B.18 and 
B.28.) 


Alias 


In equally spaced data, two frequencies are aliases of one another if 
sinusoids of the corresponding frequencies cannot be distinguished by 
their equally spaced values (this occurs when f, = 2kfy + fe for integer 
k); the principal aliases lie in the interval —fy S f S fy. (See also 14.) 
(Also alzased, aliasing, etc.) 


Aliased spectrum 


See Spectrum, aliased. 


Analysis, pilot 


Any of a number of methods of obtaining a rough spectrum, including 
the add-and-subtract method (18, etc.) the cascade method (B.18), the 
complete add-and-subtract method (B.18). 


Autocorrelation function 


The normalized autocovariance function (normalized so that its value 
for lag zero is unity). 


A utocovariance function 


The covariance between X(t) and X(t + 7) asa function of the lag r. 
If averages of X(t) and X(t + 7) are zero, it is equal to the average value 
of X(t)-X(¢ + 7). It can be defined for a whole ensemble, a whole func- 
tion stretching from — © to +, or for a finite piece of a function; in 
the latter case it is called the apparent autocovariance function (see 4). 
Certain related functions are called modified apparent autocovariance 
functions (also see 4). 


Autoregressive series 


A time series generated from another time series as the solution of a 
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linear difference equation. (Usually where previous values of output se- 
ries enter into determination of current valuc.) 


Average 


The arithmetic mean, usually over an ensemble, a population, or some 
reasonable facsimile thereof. 


Band-limited function 


Strictly, a function whose Fourier transform vanishes outside some 
finite interval (and hence is an entire function of exponential type); 
practically, a function whose Fourier transform is very small outside 
some finite interval. 


Box-car function 


A function zero except over a finite interval, in the interior of which it 
takes a constant value (often +1). 


Cardinal theorem (of interpolation theory) 


A precise statement of the conditions under which values given at a 
doubly infinite set of equally spaced points can be interpolated (with the 
aid of the function (sin (« — 2;))/(a — 2;) to yield a continuous band- 
limited function. (See B.1.) 


Cascade process (of spectral estimation) 


A process of spectral estimation in which a single step is repeated 
again and again, each step yielding both certain estimates and a con- 
densed set of data (ready for input to the next step). (See B.18.) 


Chi-square 


A quantity distributed (strictly exactly, but practically approxi- 
mately) as xy" + a -+ +++ + a where 2, %2, --- , 2% are independent 
and Gaussian, and have average zero and variance unity. 


Continuous power spectrum 


A power spectrum representable by the indefinite integral of a suit- 
able (spectral density) function. (All power spectra of physical systems 
are continuous.) 
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Convolution 


The operation on one side of a Fourier transformation corresponding 
to multiplication on the other side. (See A.3 for detailed discussion.) 


Cosine transform 


A series (see 13) or integral (see 2) transform. in which a cosine of 
the product of the variables is the kernel. 


Covariance 


A measure of (linear) common variation between two quantities, equal 
to the average product of deviations from averages. (See 1.) 


Cross-speclrum 


The expression of the mutual frequency properties of two series analo- 
gous to the spectrum of a single series. (Because mutual relations at a 
single frequency can be in phase, in quadrature, or in any mixture of 
these, either a single complex-valued cross-spectrum or a pair of real- 
valued cross-specira are required.) (Also cross-spectral.) 


Data 


As specifically used in this paper, values given at equally spaced 
intervals of time (often called time series). 


~ Data window 


A time function which vanishes outside a given interval and which is 
regarded as multiplying data or signals defined for a more extended 
period. (Data windows are usually smooth (graded) to improve the qual- 
ity of later frequency analysis.) 


Degrees of freedom 


As applied to chi-square distributions arising from quadratic forms in 
Gaussian (normal) variables, the number of linearly independent squared 
terms of equal size into which the form can be divided. In general, a 
measure of stability equal to twice the square of the average divided by 
the variance. 


Delta-component 


A finite contribution to the spectrum at one frequency (B.10 only). 
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Diffraction function 


_ sin 7a 
Te 


dif x 


~ Dirac comb 

An array of equally spaced Dirac functions, usually most of which 
are of equal height. - 
Dirac function 

The limit of functions of unit integral concentrated in smaller and 
smaller intervals near zero. (See A.2 for fuller discussion.) 
Distortion 

Failure of output to match input. (Often specified as to kind of failure 
as linear, amplitude, phase, non-linear, etc., cp. A.3.) 
Effective record length 

Actual length of record available reduced to allow for end effects. 
(See 6.) 
Elementary frequency band 

An interval of frequency conveniently thought of as containing “a 
single degree of freedom”’, equal to the reciprocal of twice the duration 
of observation or record. (Since both sines and cosines may occur, it 
requires two elementary frequency bands to contain “‘an independently 
observable frequency.’’) 
Ensemble 

A family of functions (here functions of either continuous or equi- 
spaced time) with probabilities assigned to relevant sub-families. 
Equivalent number (of degrees of freedom) 


See second sentence under degrees of freedom. 


Equivalent width 

The extent of a function regarded as a window as expressed by the 
ratio of the square of its integral to the integral of its square. (See 8.) 
Filtered spectrum 


Spectrum of the output from any process which can be regarded as a 
filter. 


MEASUREMENT OF POWER SPECTRA 273 


Folding frequency 

The lowest frequency which “is its own alias’’, that is, is the ltmit of 
both a sequence of frequencies and of the sequence of their aliases, given 
by the reciprocal of twice the time-spacing between values, also called 
Nyquist frequency. 


Fourwer transform 


Operations making functions out of functions by integration against a 
kernel of the form exponential function of ~/ —1 times frequency times 
time. Often, including here, defined differently for transforming time 
functions into frequency functions than for transforming frequency 
functions into time functions. (See A.1 for details.) 


Frequency 


A measure of rate of repetition; unless otherwise specified, the num- 
ber of cycles per second. The angular frequency is measured in radians 
per second, and is, consequently, larger by a factor of 27. 


Frequency window 


The Fourier transform of a data window. 


Gaussian 


A single quantity, or a finite number of quantities distributed accord- 
ing to a probability density representable as e to the power minus a 
quadratic form. (Also called normal, Maxwellian, etc.) Also, a function 
or ensemble, distributed in such a way that all finite sections are Gaus- 
sian. (See 1.) 


Hamming 

The operation of smoothing with weights 0.23, 0.54 and 0.23. (After 
R. W. Hamming.) 
Hanning 


The operation of smoothing with weights 0.25, 0.50 and 0.25. (After 
Julius von Hann.) 


Hyperdirectwe antenna 


An antenna or antenna system so energized as to have a more compact 
directional pattern than naturally corresponds to its extent (as measured 
in wavelengths). 
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Impulse response 

The time function describing a linear system in terms of the output 
resulting from an input described by a Dirac function. 
Independence (statistical, of estimates) 


In general, two quantities are statistically independent if they possess 
a joint distribution such that (incomplete or complete) knowledge of 
one does not alter the distribution of the other. Estimates are statis- 
tically independent if this property holds for each fixed true situation. 
Independent phases 


An ensemble has independent phases when it can be approximated by 
ensembles consisting of finite sums of (phased) cosines (of fixed fre- 
quencies) whose phases are mutually independent. Continuous spectrum 
and independent phases imply Gaussian character. Every Gaussian 
ensemble has independent phases. 

Intermodulation distortion 

Non-linear distortion, especially as recognized in the output of a 
system when two or more frequencies enter the input simultaneously. 
Joint probability distribution 


Expression of the probability of simultaneous occurrence of values of 
two or more quantities. 


Lag 


A difference in time (epoch) of two events or values considered to- 
gether. 


Lag window 

A function of lag, vanishing outside a finite interval, and either mul- 
tiplying or regarded as multiplying the quantities of a family of quantities 
with differing lags. 
Lagged product 

The product of two values corresponding to different times. (In a 
mean lagged product the lags are usually all the same.) 
Lead 

The negative of lag. 
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Line (in a power spectrum) 

Theoretically, and as used in this paper, a finite contribution asso- 
clated with a single frequency. Physically, not used here, a finite con- 
tribution associated with a very narrow spectral region. 

Lobe 


A bulge, positive or negative, especially in a spectral window. (In most 
spectral windows, a large central mazn lobe is surrounded on both sides 
by smaller side lobes.) 

Mean lagged product 
The (arithmetic) mean of products of equally lagged quantities. 


Moving linear combination 

A transformation expressing the values of an output time series as 
linear combinations of values of the input series in specified relations of 
lag (or lead). 

Negative frequencies 

When sines and cosines are jointly represented by two imaginary ex- 
ponentials, one has a positive frequency and the other a negative fre- 
quency. (Not specifiable for a single time function in real terms.) 
Network (linear) 

In this account, an otherwise unspecified physical device which con- 
verts an input function (of continuous time) linearly into an output func- 
tion (of continuous time). 

Noise 


In general, an undesired time-function, or component of a function. 


Non-normality 


Failure to follow a normal or Gaussian distribution. 


Normality 


The property of following a normal or Gaussian distribution. 


Nyquist frequency 


The lowest frequency coinciding with one of its own aliases, the re- 
ciprocal of twice the time interval between values (same as folding fre- 
quency). 
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Octave 


An interval of frequencies, the highest of which is double the lowest. 


Pilot (analysis or estimation) 

A process yielding rough estimates of spectral density intended mainly 
as a basis for planning more complete and precise analyses. 
Population 

A collection of objects (in particular, of numbers or of functions), with 
probabilities attached to relevant subcollections. 
Power transfer function 

The function expressing the ratio of output power near a given fre- 
quency to the input power near that frequency. 
Power-variance spectrum 


A function of frequency, in terms of which the variances and covari- 
ances of a family of spectral estimates can be expressed in standard 
form. (See 6 and 14 for details in the continuous and equi-spaced 
cases, respectively.) , 


Preemphasis 


Emphasis of certain frequencies (in comparison with others), before 
processing, as an aid to the quality of result. 


Prewhitening 


Preemphasis designed to make the spectral density more nearly con- 
stant (the spectrum more nearly flat). 


Principal alias 


An alias falling between zero and plus or minus the folding or Ny- 
quist frequency. 


Process (random or stochastic) 


An ensemble of functions. (Often composed of functions of time re- 
garded as unfolding or developing.) 


Protection ratio 


The ratio of transmission at a desired frequency to the transmission 
at an undesired alias of that frequency. 
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Recording 

Is spaced when originally taken at equal intervals, mixed when taken 
continuously and processed at equal intervals, continuous when taken 
and processed on a continuous basis. 
Resolution 

A measure of the concentration of a spectral estimate expressed in 
frequency units, here taken (for the important cases) as equal to the 
width of the major lobe. (See B.23.) 
Resolved bands (number of) 


The ratio of the Nyquist or folding frequency to the resolution. 


Sampling theorem (of information theory) 

Nyquist’s result that equi-spaced data, with two or more points per cycle 
of highest frequency, allows reconstruction of band-limited functions. 
(See Cardinal theorem.) 

Serial correlation coefficients 

Ratios of the autocovariances to the variance of a process, ensemble, 
etc. 
Signal 


A time function desired as (potentially) carrying intelligence. 


“Signal” 

A function of continuous time, which may be either a signal, a noise, 
or a combination of both. (Contrasted with data, a function of discrete 
time.) 

Single function approach 

A mode of representing certain ensembles by the translations of a 
single time function (in single function terms). 
Smoothed function 

The result of weighted averaging of nearby values of the original © 
function. 
Smoothing | 


In the narrow sense, forming (continuous or discrete) moving linear 
combinations with unit total weight. 
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Smoothing and decimation procedure 


A procedure which may be regarded as the formation of discrete mov- 
ing linear combinations, followed by the omission of all but every kth 
such. (See 17 and B.17.) 


Spectrum (also power spectrum) 


An expression of the second moments of an ensemble, process, etc. (i) 
in terms of frequencies, (ii) in such a form as to diagonalize the effects 
on second moments of time-invariant linear transformations applied to 
the ensemble or process. (adjective: spectral). 


Spectrum, aliased 


Tor equally spaced data, the principal part of the aliased spectrum 
expresses contributions to the variance in terms of frequencies between 
zero and the Nyquist or folding frequency, all contributions from fre- 
quencies having the same principal alias and sign having been combined 
by addition. (The aliased spectrum repeats the principal part periodically 
with period 2fy . See 14.) 


Spectral density 


A value of a function (or the entire function) whose integral over any 
frequency interval represents the contribution to the variance from that 
frequency interval. 


Spectral density estimates 


Estimates of spectral density, termed raw when obtained from equi- 
spaced mean lagged products by cosine series transformation, refined 
when hanned or hammed from raw estimates or obtained by an equiva- 
lent process. (See B.13.) 


Spectral window 


A function of frequency expressing the contribution of the spectral 
density at each frequency to the average value of an estimate of 
(smoothed) spectral density. 


Stattonary (ensemble or random process) 


An ensemble of time functions (or random process) is stationary if 
any translation of the time origin leaves its statistical properties un- 
affected. 
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Superposition theorem 

A statement that the output of a linear device is the convolution of its 
input with its impulse response. (See B.2.) 
Temporally homogeneous 

Sometimes used in place of stationary, especially when speaking of 
stochastic processes. 
Transfer function 


The transfer function of a network or other linear device is a complex- 
valued function expressing the amplitude and phase changes suffered by 
cosinusoidal inputs in becoming outputs. (See A.5.) The square of the 
absolute value of the transfer function is the power transfer function, 
which expresses the factors by which spectral densities are changed as 
inputs become outputs. (See 4.) 


Transmission 

The coefficient with which power at a given frequency contributes to 
power at the (new) principal alias as a result of the application of a 
smoothing and decimation procedure. 
Transversal filtering 

Time domain filtering by forming linear combinations of lagged values, 
use of moving linear combinations for filtering. (See Kallmann” for the 
origin of this term.) 
Trend 

A systematic, smooth component of a time function (time series), as, 
for example, a linear function of time (a linear trend). 
True 

Often used to refer to average values over the ensemble, as contrasted 
with mean values over the observations. 
Universe 

A collection of objects (numbers, functions, etc.) with probabilities 
attached to relevant subcollections. 
Variance 


A quadratic measure of variability, the average squared deviation 
from the average. 
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White noise 


An ensemble whose spectral density is sensibly constant from zero 
frequency through the frequencies of interest (in equi-spaced situations, 
up to the folding or Nyquist frequency). (The values of equi-spaced 
white noise at different times are independent.) 


Window 


A function expressing, as a multiplicative factor, the tendency or 
possibility of the various values of some function to enter into some 
calculation or contribute to the average value of some quantity. (See 
data, lag, spectral, etc. for specific instances.) 


Windowless quadratic 


A quadratic expression is windowless if its average value vanishes 
for every stationary ensemble of finite variance (See B.19). 


Window pair 


Two windows related by a Fourier transformation, as lag and spectral 
windows or data and frequency windows. (See A.4 and 4.) 


Zero-frequency waves (cosine and sine) 


The limiting forms of very-low-frequency cosinusoids, namely con- 
stants and linear trends. (See 19.) 
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